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SEQUENCE LISTING 



(1) GENERAL INFORMATION 

(i) APPLICANT: Hadlaczky, Gyula 
Szalay, Aladar 

(ii) TITLE OF INVENTION: ARTIFICIAL CHROMOSOMES, USES THEREOF AND 
METHODS PREPARING ARTIFICIAL CHROMOSOMES 

(iii) NUMBER OF SEQUENCES: 12 



(iv) 
(A 
(B 
(C 
(D 
(E 
(F 

(v) 



(A 
(B 
(C 
(D 

(vi) 
(A 
(B 
(C 

( vii 
(A 
(B 
(C 

(vii. 
(A 
(B 
(C 

(ix) 
(A 
(B 
(C 



CORRESPONDENCE ADDRESS : 
ADDRESSEE: Brown, Martin, Haller & McClain 
STREET: 1660 Union Street 
CITY: San Diego 
STATE : CA 
COUNTRY: USA 
ZIP: 92101-2926 

COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 

CURRENT APPLICATION DATA: 
APPLICATION NUMBER: 
FILING DATE: 
CLASSIFICATION: 



PRIOR APPLICATION DATA: 
APPLICATION NUMBER: 08/629,822 
FILING DATE: 10-APR-1996 
CLASSIFICATION : 

) ATTORNEY/AGENT INFORMATION: 
NAME: Seidraan, Stephanie L 
REGISTRATION NUMBER: 33,779 
REFERENCE /DOCKET NUMBER: 6869-40 2A 

TELECOMMUNICATION INFORMATION: 
TELEPHONE: 619-238-0999 
TELEFAX: 619-238-0062 
TELEX : 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1293 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 



(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

G AAT T CAT C A TTTTTCANGT CCTCAAGTGG ATGTTTCTCA TTTNCCATGA TTTTAAGTTT 6 0 

TCTCGCCATA TTCCTGGTCC TACAGTGTGC ATTTCTCCAT TTTNCACGTT TTNCAGTGAT 12 0 

TTCGTCATTT TCAAGTCCTC AAGTGGATGT TTCTCATTTN CCATGAATTT CAGTTTTCTN 18 0 

GCCATATTCC ACGTCCTACA GNGGACATTT CTAAATTTNC CACCTTTTTC AGTTTTCCTC 24 0 

GCCATATTTC ACGTCCTAAA ATGTGTATTT CTCGTTTNCC GTGATTTTCA GTTTTCTCGC 3 00 

CAGATTCCAG GTCCTATAAT GTGCATTTCT CATTTNNCAC GTTTTTCAGT GATTTCGTCA 3 60 

TTTTTTCAAG TCGGCAAGTG GATGTTTCTC ATTTNCCATG ATTTNCAGTT TTCTTGNAAT 42 0 

ATTCCATGTC CTACAATGAT CATTTTTAAT TTTCCACCTT TTCATTTTTC CACGCCATAT 4 80 

TTCATGTCCT AAAGTGTATA TTTCTCCTTT TCCGCGATTT TCAGTTTTCT CGCCATATTC 54 0 

CAGGTCCTAC AGTGTGCATT CCTCATTTTT CACCTTTTTC ACTGATTTCG TCATTTTTCA 600 

AGTCGTCAAC TGGATCTTTC TAATTTTCCA TGATTTTCAG TTATCTTGTC ATATTCCATG 660 

TCCTACAGTG GACATTTCTA AATTTTCCAA CTTTTTCAAT TTTTCTCGAC ATATTTGACG 72 0 

TGCTAAAGTG TGTATTTCTT ATTTTCCGTG ATTTTCAGTT TTCTCGCCAT ATTCCAGGTC 780 

CTAATAGTGT GCATTTCTCA TTTTTCACGT TTTTCAGTGA TTTCGTCATT TTTTCCAGTT 84 0 

GTCAAGGGGA TGTTTCTCAT TTTCCATGAG TGTCAGTTTT CTTGCTATAT TCCATGTCCT 90 0 

ACAGTGACAT TTCTAAATAT TATACCTTTT TCAGTTTTTC T C AC CAT AT T TCACGTCCTA 96 0 

AAGTATATAT TTCTCATTTT CCCTGATTTT CAGTTTCCTT GCCATATTCC AGGTCCTACA 102 0 

GTGTGCATTT CTCATTTTTC ACGTTTTTCA GTAATTTCTT CATTTTTTAA GCCCTCAAAT 108 0 

GGATGTTTCT CATTTTCCAT GATTTTCAGT TTTCTTGCCA TATACCATGT CCTACAGTGG 114 0 

ACATTTCTAA ATTATCCACC TTTTTCAGTT TTTCATCGGC ACATTTCACG TCCTAAAGTG 12 0 0 

TGTATTTCTA ATTTTCAGTG ATTTTCAGTT TTCTCGCCAT ATTCCAGGAC CTACAGTGTG 12 6 0 

CATTTCTCAT TTTTCACGTT TTTCAGTGAA TTC 1293 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1044 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

AGGCCTATGG TGAAAAAGGA AATATCTTCC CCTGAAAACT AGACAGAAGG ATTCTCAGAA 6 0 

TCTTATTTGT GATGTGCGCC CCTCAACTAA CAGTGTTGAA GCTTTCTTTT G ATAG AG C AG 12 0 

TTTTGAAACA CTCTTTTTGT AAAATCTGCA AGAGGATATT TGGATAGCTT TGAGGATTTC 18 0 

CGTTGGAAAC GGGATTGTCT TCATATAAAC CCTAGACAGA AGCATTCTCA GAAGCTTCAT 24 0 

TGGGATGTTT CAGTTGAAGT CACAGTGTTG AACAGTCCCC TTTCATAGAG CAGGTTTGAA 30 0 

ACACTCTTTT TTGT AGTAT C TGGAAGTGGA CATTTGGAGC GATCTCAGGA CTGCGGTGAA 360 

AAAGGAAATA TCTTCCAATA AAAGCTAGAT AGAGGCAATG TCAGAAACCT TTTT CATGAT 42 0 

GTATCTACTC AGCTAACAGA GTTGAACCTT CCTTTGAGAG AGCAGTTTTG AAACACTCTT 48 0 

TTTGTGGAAT CTGCAAGTGG ATATTTGTCT AGCTTTGAGG ATTTCGTTGG GAAACGGGAT 54 0 

TACATATAAA AAGCAGACAG CAGCATTCCC AGAAACTTCT TTGTGATGTT TGCATTCAAG 60 0 

TCACAGAGTT GAACATTCCC TTTCATAGAG CAGGTTTGAA ACACACTTTT TGATGTATCT 66 0 

GGATGTGGAC ATTTGCAGCG CTTTCAGGCC TAAGGTGAAA AGGAAATATC TTCCCCTGAA 72 0 



AACTAGACAG AAGCATTCTC AGAAACTTAT TTGTGATGTG CGCCCTCAAC TAACAGTGTT 78 0 

GAAGCTTTCT TTTGATAGAG GCAGTTTTGA AACACT CTTT TGTGGAATCT GCAAGTGGAT 84 0 

ATTTGTCTAG CTTTGAGGAT TTCTTTGGAA ACGGGATTAC ATATAAAAAG CAGACAGCAG 900 

CATTCCCAGA ATCTTGTTTG TGATGTTTGC ATTCAAGTCA CAGAGTTGAA CATTCCCTTT 96 0 

CAGAGAGCAG GTTTGAACAC TCTTTTTATA GTATCTGGAT GTGGACATTT GGAGCGCTTT 102 0 

CAGGGGGGAT CCTCTAGAAT TCCT 1044 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

CTGCAGCTGG GGGTCTCCAA TCAGGCAGGG GCCCCTTACT ACT CAGATGG GGTGGCCGAG 6 0 

TAGGGGAAGG GGGTGCAGGC TGCATGAGTG GACACAGCTG TAGGACTACC TGGGGGCTGT 12 0 

GGATCTATGG GGGTGGGGAG AAGCCCAGTG ACAGTGCCTA GAAGAGACAA GGTGGCCTGA 18 0 

GAGGGTCTGA GGAACATAGA GCTGGCCATG TTGGGGCCAG GTCTCAAGCA GGAAGTGAGG 24 0 

AATGGGACAG GCTTGAGGAT ACTCTACTCA GTAGCCAGGA TAGCAAGGAG GGCTTGGGGT 3 00 

TGCTATCCTG GGGTTCAACC CCCCAGGTTG AAGGCCCTGG GGGAGATGGT CCCAGGACAT 360 

ATTACAATGG ACACAGGAGG TTGGGACACC TGGAGTCACC AAACAAAACC ATGCCAAGAG 42 0 

AGACCATGAG TAGGGGTGTC CAGTCCAGCC CTCTGACTGA GCTGCATTGT TCAAATCCAA 4 80 

AGGGCCCCTG CTGCCACCTA GTGGCTGATG G CAT C C AC AT GACCCTGGGC CACACGCGTT 54 0 

TAGGGTCTCT GTGAAGACCA AGATCCTTGT TACATTGAAC GACTCCTAAA TGAGCAGAGA 60 0 

TTTCCACCTA TTCGAAACAA TCACATAAAA TCCATCCTGG AAAAAGCCTG GGGGATGGCA 66 0 

CTAAGGCTAG GGATAGGGTG GGATGAAGAT TATAGTTACA GTAAGGGGTT TAGGGTTAGG 72 0 

GATCAACGTT GGTTAGGAGT TAGGGATACA GTAGGGTACC GGTAGGGTTA GGGGTTAGGG 78 0 

TTAGGGGTTA GGGTTAGGGT TAGGGTTAGG GTTAGGGTTA GGGGTTAGGG GTTAGGGTTA 84 0 

GGGTTAGGTT TTGGGGTGGC GTATTTTGGT CTTATACGCT GTGTTCCACT GGCAATGAAA 90 0 

AGAGTTCTTG TTTTTCCTTC AGCAATTTGT CATTTTTAAA AGAGTTTAGC AATTCTAACA 96 0 

GATATAGACC AGCTGTGCTA TCTCATTGTG GTTTTCAATT GTAACCACAT TGTGGTTTCA 102 0 

ATGTGTTTAC TTGCCATCTG TAGATCTTCT TTGCGTGAGG TGTCTGTTCA GATGTGTGTG 108 0 

CATTTCTTGN NTTTNGGCTG TTTAACTTAT TGTTTAGTTT TAATAATTTT TTATATATTT 114 0 

GAAGACAAAT CTTTCTCAGA TGTGTATTTG CAAATATTTC TTCAATATGA GGCTTGCTTT 120 0 

TGTCTCTAAC AAGGTCTCTT CAGAGATAAC TTAAATATAA GAAATCCACA CTGTCACTTC 126 0 

TTTTGTGTAT ATCTACCTTT TGTGTCATTT GTTAAAATTC ATTACCAAAC CCAAAGGCAG 132 0 

ATAGCTTTTC TTCTATTGTT TCTTCTAGAA ATTTGTATAG TTTTGCATTT TTAGTGTAAG 138 0 

GATGATTTTG AGTGATTATT TGTGTAAGTT GTAAAGTTTT CGTCTATATC CAT AT C AT T T 144 0 

CTTATGGTTT CCAATTAATC GTTCCCTCAC TATTTTTGGG AAAGACACAG GATAGTGGGC 150 0 

TTTGTTAGAG TAGATAGGTA GCTAGACATG AACAGGAGGG GGCCTCCTGG AAAAGGGAAA 156 0 

GTCTGGGAAG GCTCACCTGG AGGACCACCA AAAATT CACA TATTAGTAGC ATCTCTAGTG 162 0 

CTGGAGTGGA TGGGCACTTG TCAATTGTGG GTAGGAGGGA AAAGAGGTCC TATGCAGAAA 168 0 

GAAACTCCCT AGAACTCCTC TGAAGATGCC CCAATCATTC ACTCTGCAAT AAAAATGTCA 174 0 

GAATATTGCT AGCTACATGC TGATAAGGNN AAAGGGGACA TTCTTAAGTG AAACCTGGCA 180 0 

CCATAAGTAC AGATTAGGGC AGAGAAGGAC ATTCAAAAGA GGCAGGCGCA GTAGGTACAA 186 0 

ACGTGATCGC TGTCAGTGTG CCTGGGATGG CGGGAAGGAG GCTGGTGCCA GAGTGGATTC 192 0 



GTATTGATCA CCACACATAT ACCTCAACCA ACAGTGAGGA GGTCCCACAA GCCTAAGTGG 1980 

GGCAAGTTGG GGAGCTAAGG CAGTAGCAGG AAAACCAGAC AAAGAAAACA GGTGGAGACT 2 04 0 

TGAGACAGAG GCAGGAATGT GAAGAAATCC AAAATAAAAT TCCCTGCACA GGACTCTTAG 210 0 

GCTGTTTAAT GCATCGCTCA GTCCCACTCC TCCCTATTTT TCTACAATAA ACTCTTTACA 216 0 

CTGTGTTTCT TTTCAATGAA GTTATCTGCC ATCTTTGTAT TGCCTCTTGG TGAAAATGTT 222 0 

TCTTCCAAGT TAAACAAGAA CTGGGACATC AGCTCTCCCC AGTAATAGCT CCGTTTCAGT 22 8 0 

TTGAATTTAC AGAACTGATG GGCTTAATAA CTGGCGCTCT GACTTTAGTG GTGCAGGAGG 234 0 

CCGTCACACC GGGACCAAGA GTGCCCTGCC TAGTCCCCAT CTGCCCGCAG GTGGCGGCTG 2400 

CCTCGACACT GACAGCAATA GGGTCCGGCA GTGTCCCCAG CTGCCAGCAG GGGGCGTACG 24 60 

ACGACTACAC TGTGAGCAAG AGGGCCCTGC AG 24 92 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
GGGGAATTCA TTGGGATGTT TCAGTTGA 2 8 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
CGAAAGTCCC CCCTAGGAGA TCTTAAGGA 2 9 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: RNA 



(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
CCGCTTAATA CTCTGATGAG TCCGTGAGGA CGAAACGCTC TCGCACC 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
CGATTTAAAT TAATTAAGCC CGGGC 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
TAAATTTAAT TAATTCGGGC CCGTCGA 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(D) OTHER INFORMATION IL-2 signal sequence 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

ATG TAC AGG ATG CAA CTC CTG TCT TGC ATT GCA CTA AGT CTT GCA CTT 4 8 

Met Tyr Arg Met Gin Leu Leu Ser Cys lie Ala Leu Ser Leu Ala Leu 

GTC ACA AAC AGT GCA CCT ACT 6 9 

Val Thr Asn Ser Ala Pro Thr 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 945 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 1...942 

(D) OTHER INFORMATION: Renilla Reinformis Lucif erase 

(x) PUBLICATION INFORMATION: 

PATENT NO. : 5,418,155 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

AGC TTA AAG ATG ACT TCG AAA GTT TAT GAT CCA GAA CAA AGG AAA CGG 48 
Ser Leu Lys Met Thr Ser Lys Val Tyr Asp Pro Glu Gin Arg Lys Arg 
1 5 10 15 

ATG ATA ACT GGT CCG CAG TGG TGG GCC AGA TGT AAA CAA ATG AAT GTT 96 
Met lie Thr Gly Pro Gin Trp Trp Ala Arg Cys Lys Gin Met Asn Val 
2 0 2b " 3 0 

CTT GAT TCA TTT ATT AAT TAT TAT GAT TCA GAA AAA CAT GCA GAA AAT 144 
Leu Asp Ser Phe lie Asn Tyr Tyr Asp Ser Glu Lys His Ala Glu Asn 
35 4 0 45 

GCT GTT ATT TTT TTA CAT GGT AAC GCG GCC TCT TCT TAT TTA TGG CGA 192 
Ala Val lie Phe Leu His Gly Asn Ala Ala Ser Ser Tyr Leu Trp Arg 
50 55 60 

CAT GTT GTG CCA CAT ATT GAG CCA GTA GCG CGG TGT ATT ATA CCA GAT 240 
His Val Val Pro His lie Glu Pro Val Ala Arg Cys lie lie Pro Asp 
65 70 75 80 

CTT ATT GGT ATG GGC AAA TCA GGC AAA TCT GGT AAT GGT TCT TAT AGG 2 88 

Leu lie Gly Met Gly Lys Ser Gly Lys Ser Gly Asn Gly Ser Tyr Arg 

85 ~ 90 95 



TTA CTT GAT 
Leu Leu Asp 



TAC CAA AGA 
Tyr Gin Arg 
115 

GCA TTT CAT 
Ala Phe His 
130 

CAC GCT GAA 
His Ala Glu 
145 

GAT ATT GAA 
Asp lie Glu 



ATG GTT TTG 
Met Val Leu 



ATC ATG AGA 
lie Met Arg 
195 

TTC AAA GAG 
Phe Lys Glu 
210 

GAA ATC CCG 
Glu lie Pro 
225 

AGG AAT TAT 
Arg Asn Tyr 



TTT ATT GAA 
Phe lie Glu 



GCC AAG AAG 
Ala Lys Lys 
275 

TTT TCG CAA 
Phe Ser Gin 
290 

TTC GTT GAG 
Phe Val Glu 
305 



CAT TAC AAA 
His Tyr Lys 
100 

AGA TCA TTT 
Arg Ser Phe 



TAT AGC TAT 
Tyr Ser Tyr 



AGT GTA GTA 
Ser Val Val 
150 

GAA GAT ATT 
Glu Asp lie 
165 

GAG AAT AAC 
Glu Asn Asn 
180 

AAG TTA GAA 
Lys Leu Glu 



AAA GGT GAA 
Lys Gly Glu 



TTA GTA AAA 
Leu Val Lys 
230 

AAT GCT TAT 
Asn Ala Tyr 
245 

TCG GAT CCA 
Ser Asp Pro 
260 

TTT CCT AAT 
Phe Pro Asn 



GAA GAT GCA 
Glu Asp Ala 



CGA GTT CTC 
Arg Val Leu 
310 



TAT CTT ACT 
Tyr Leu Thr 
105 

TTT GTC GGC 
Phe Val Gly 
120 

GAG CAT CAA 
Glu His Gin 
135 

GAT GTG ATT 
Asp Val lie 



GCG TTG ATC 
Ala Leu lie 



TTC TTC GTG 
Phe Phe Val 
185 

CCA GAA GAA 
Pro Glu Glu 
200 

GTT CGT CGT 
Val Arg Arg 
215 

GGT GGT AAA 
Gly Gly Lys 



CTA CGT GCA 
Leu Arg Ala 



GGA TTC TTT 
Gly Phe Phe 
265 

ACT GAA TTT 
Thr Glu Phe 
280 

CCT GAT GAA 
Pro Asp Glu 
295 

AAA AAT GAA 
Lys Asn Glu 



GCA TGG TTG 
Ala Trp Leu 



CAT GAT TGG 
His Asp Trp 



GAT AAG ATC 
Asp Lys lie 
140 

GAA TCA TGG 
Glu Ser Trp 
155 

AAA TCT GAA 
Lys Ser Glu 
170 

GAA ACC ATG 
Glu Thr Met 



TTT GCA GCA 
Phe Ala Ala 



CCA ACA TTA 
Pro Thr Leu 
220 

CCT GAC GTT 
Pro Asp Val 
235 

AGT GAT GAT 
Ser Asp Asp 
250 

TCC AAT GCT 
Ser Asn Ala 



GTC AAA GTA 
Val Lys Val 



ATG GGA AAA 
Met Gly Lys 
300 

CAA TAA 
Gin 



AAC TTC TTA 
Asn Phe Leu 
110 

GGT GCT TGT 
Gly Ala Cys 
125 

AAA GCA ATA 
Lys Ala lie 



GAT GAA TGG 
Asp Glu Trp 



GAA GGA GAA 
Glu Gly Glu 
175 

TTG CCA TCA 
Leu Pro Ser 
190 

TAT CTT GAA 
Tyr Leu Glu 
205 

TCA TGG CCT 
Ser Trp Pro 



GTA CAA ATT 
Val Gin lie 



TTA CCA AAA 
Leu Pro Lys 
255 

ATT GTT GAA 
lie Val Glu 
270 

AAA GGT CTT 
Lys Gly Leu 
285 

TAT ATC AAA 
Tyr lie Lys 



ATT 3 3 6 

He 



TTG 3 84 

Leu 



GTT 432 
Val 



CCT 4 80 

Pro 

160 

AAA 528 
Lys 



AAA 576 
Lys 



CCA 624 
Pro 



CGT 6 72 

Arg 



GTT 72 0 

Val 

240 

ATG 768 
Met 



GGC 816 
Gly 



CAT 8 64 

His 



TCG 912 
Ser 



945 



(2) INFORMATION FOR SEQ ID NO: 11: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
TTTGAATTC A TGTACAGGAT GCAACTCCTG 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 



TTTGAATTCA GTAGGTGCAC TGTTTGTCAC 



6869-402A 



ARTIFICIAL CHROMOSOMES, USES THEREOF AND METHODS FOR 
PREPARING ARTIFICIAL CHROMOSOMES 

RELATED APPLICATIONS 

This application is a continuation of U.S. application Serial No. 
5 08/629,822, filed April 10, 1996. This application is related to U.S. 
application Serial No. 08/759,558, now U.S. Patent No. 5,288,625 and 
to U.S. application Serial No. 08/375,271, filed 1/19/95, which is a 
continuation of U.S. application Serial No. 08/080,097, filed 6/23/93 
which is a continuation of U.S. application Serial No. 07/892,487, filed 

10 6/3/92, which is a continuation of U.S. application Serial No. 
07/521,073, filed 5/9/90. 

The subject matter of each of the above-listed U.S. applications is 
incorporated in its entirety by reference thereto. 
FIELD OF THE INVENTION 

15 The present invention relates to methods for preparing cell lines 

that contain artificial chromosomes, to methods for isolation of the arti- 
ficial chromosomes, targeted insertion of heterologous DNA into the 
chromosomes, isolation of the chromosomes, and delivery of the chromo- 
somes to selected cells and tissues. Also provided are cell lines for use in 

20 the methods, and cell lines and chromosomes produced by the methods. 
BACKGROUND OF THE INVENTION 

Several viral vectors, non-viral, and physical delivery systems for 
gene therapy have been developed [see, e.g. , Mitani et aL (1 993) Trends 
Biotech. 11 :162-166]. The presently available systems, however, have 

25 numerous limitations, particularly where persistent, stable, or controlled 
gene expression is required. These limitations include: (1) size limitations 
because there is a limit, generally on order of about ten kilobases [kB], at 
most, to the size of the DNA insert [gene] that can be accepted by viral 
vectors, whereas a number of mammalian genes of possible therapeutic 

30 importance are well above this limit, especially if all control elements are 
included; (2) the inability to specifically target integration so that random 
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integration is required which carries a risk of disrupting vital genes or 
cancer suppressor genes; (3) the expression of randomly integrated 
therapeutic genes may be affected by the functional compartmentalization 
in the nucleus and are affected by chromatin-based position effects; (4) 
5 the copy number and consequently the expression of a given gene to be 
integrated into the genome cannot be controlled. Thus, improvements in 
gene delivery and stable expression systems are needed [see, e.g. . 
Mulligan (1993) Science 260 :926-932]. 

In addition, safe and effective gene therapy methods and vectors 

10 should have numerous features that are not assured by the presently 
available systems. For example, a safe vector should not contain DNA 
elements that can promote unwanted changes by recombination or 
mutation in the host genetic material, should not have the potential to 
initiate deleterious effects in cells, tissues, or organisms carrying the 

15 vector, and should not interfere with genomic functions. In addition, it 
would be advantageous for the vector to be non-integrative, or designed 
for site-specific integration. Also, the copy number of therapeutic gene(s) 
carried by the vector should be controlled and stable, the vector should 
secure the independent and controlled function of the introduced gene(s); 

20 and the vector should accept large (up to Mb size) inserts and ensure the 
functional stability of the insert. 

The limitations of existing gene delivery technologies, however, 
argue for the development of alternative vector systems suitable for 
transferring large [up to Mb size or larger] genes and gene complexes 

25 together with regulatory elements that will provide a safe, controlled, and 
persistent expression of the therapeutic genetic material. 

At the present time, none of the available vectors fulfill these 
requirements. Some of these characteristics, however, are possessed by 
chromosomes. Thus, an artificial chromosome would be an ideal vector 

30 for gene therapy, as well as for production of gene products that require 
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coordination of expression of numerous genes or that are encoded by 
large genes, and other uses. Artificial chromosomes for expression of 
heterologous genes in yeast are available, but construction of a 
mammalian artificial chromosome has not been achieved. Such 
5 construction has been hindered by the lack of an isolated, functional, 
mammalian centromere and uncertainty regarding the requisites for its 
production and stable replication. Unlike in yeast, there are no selectable 
genes in close proximity to a mammalian centromere, and the presence of 
long runs of highly repetitive pericentric heterochromatic DNA makes the 

10 isolation of a mammalian centromere using presently available methods, 
such as chromosome walking, virtually impossible. Other strategies are 
required for production of mammalian artificial chromosomes, and some 
have been developed. For example, U.S. Patent No. 5,288,625 provides 
a cell line that contains an artificial chromosome, a minichromosome, that 

15 is about 20 to 30 megabases. Methods provided for isolation of these 
chromosomes, however, provide preparations of only about 10-20% 
purity. Thus, development of alternative artificial chromosomes and 
perfection of isolation methods as well as development of more versatile 
chromosomes and further characterization of the minichromosomes is 

20 required to realize the potential of this technology. 

Therefore, it is an object herein to provide mammalian artificial 
chromosomes and methods for introduction of foreign DNA into such 
chromosomes. It is also an object herein to provide methods for 
introduction of the artificial mammalian chromosome into selected cells, 

25 and to provide the resulting cells, as well as transgenic animals and plants 
that contain the artificial chromosomes. It is also an object herein to 
provide methods for gene therapy and expression of gene products using 
artificial chromosomes. It is a further object herein to provide methods 
for constructing species-specific artificial chromosomes. 
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SUMMARY OF THE INVENTION 

Mammalian artificial chromosomes [MACs] are provided. Also 
provided are artificial chromosomes for other higher eukaryotic species, 
such as insects and fish, produced using the MACS are provided herein. 
5 Methods for generating and isolating such chromosomes. Methods using 
the MACs to construct artificial chromosomes from other species, such as 
insect and fish species are also provided. The artificial chromosomes are 
fully functional stable chromosomes. Two types of artificial 
chromosomes are provided. One type, herein referred to as SATACs 

10 [satellite artificial chromosomes] are stable heterochromatic 

chromosomes, and the another type are minichromosomes based on 
amplification of euchromatin. 

Artificial chromosomes permit targeted integration of megabase pair 
size DNA fragments that contain single or multiple genes. Thus methods 

15 using the MACs to introduce the genes into cells, animals and tissues are 
also provided. The artificial chromosomes with integrated heterologous 
DNA are to be used in methods of gene therapy, in methods of 
production of gene products, particularly products that require expression 
of multigene biosynthetic pathways, and also are intended for delivery 

20 into the nuclei of germline cells, such as embryo-derived stem cells [ES 
cells] for production of transgenic animals. 

Mammalian artificial chromosomes provide extra-genomic specific 
integration sites for introduction of genes encoding proteins of interest 
and permit megabase size DNA integration so that, for example, genes 

25 encoding an entire metabolic pathway or a very large gene, such as the 
cystic fibrosis [CF; ~600kb] gene, several genes, such as a series of 
antigens for preparation of a multivalent vaccine, can be stably introduced 
into a cell. Vectors for targeted introduction of such genes, including the 
tumor suppressor genes, such as p53, the cystic fibrosis transmembrane 
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regulator gene [CFTR], anti-HIV ribozymes, such as an anti-HIV gag 
ribozyme, into the artificial chromosomes also provided. 

The chromosomes provided herein are generated by introducing 
heterologous DNA that includes DNA encoding a selectable marker into 
5 cells, preferably a stable cell line, growing the cells under selective 
conditions, and identifying from among the resulting clones those that 
include chromosomes with more than one centromere or that have 
chromosomes that are fragments of chromosomes that had more than 
one centromere. The amplification that produces the additional 

10 centromere occurs in cells that contain chromosomes in which the 

heterologous DNA has integrated near the centromere in the pericentric 
region of the chromosome. The selected clonal cells are then used to 
generate artificial chromosomes. 

In preferred embodiments, the DNA with the selectable marker that 

15 is introduced includes sequences that target it to the pericentric region of 
the chromosome. For example, vectors, such as pTEMPUD, which 
includes such DNA specific for mouse satellite DNA, are provided. Also 
provided are derivatives of pTEMPUD that specifically target human 
satellite sequences. Upon integration, these vectors can induce the 

20 amplification. 

Artificial chromosomes are generated by culturing the cells with the 
dicentric chromosomes under conditions whereby the chromosome breaks 
to form a minichromosome and formerly dicentric chromosome. The 
artificial chromosomes [the SATACs] are generated, not from the 

25 minichromosome fragment as, for example, in U.S. Patent No. 

5,288,625, but from the fragment of the formerly dicentric chromosome. 
Among the MACs provided herein are the SATACs, which are 
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primarily made up of repeating units of short satellite DNA and are fully 
heterochromatic, so that absent insertion of heterologous or foreign DNA, 
the chromosomes do not contain genetic information. They can thus be 
used as "safe" vectors for delivery of DNA to mammalian hosts because 
they do not contain any potentially harmful genes. 

In addition to MACs methods for generating euchromatic 
minichromsomes and the use thereof are also provided herein. Methods 
for generating one type of MAC the minichromosome, previously 
described in U.S. Patent No. U.S. Patent No. 5,288,625, and the use 
thereof for expression of heterologous DNA are provided. Cell lines 
containing the minichromosome and the use thereof for cell fusion are 
also provided. 

In one embodiment, a cell line containing the mammalian 
minichromosome is used as recipient cells for donor DNA encoding a 
selected gene or multiple genes. The donor DNA is linked to a second 
selectable marker and is targeted to and integrated into the 
minichromosome. The resulting chromosome is transferred by cell fusion 
into an appropriate recipient cell line, such as a Chinese hamster cell line 
[CHO]. After large scale production of the cells carrying the engineered 
chromosome, the chromosome is isolated. In particular, metaphase 
chromosomes are obtained, such as by addition of colchicine, and they 
are purified from the cell lysate. These chromosomes are used for 
cloning, sequencing and for delivery of heterologous DNA into cells. 

Also provided are SATACs of various sizes that are formed by 
repeated culturing under selective conditions and subcloning of cells that 
contain chromosomes produced from the formerly dicentric 
chromosomes. These chromosomes are based on repeating units 7.5 to 
10 Mb referred to herein as megareplicons, that are tandem blocks of 
satellite DNA flanked by heterologous non-satellite DNA. Amplification 
produces a tandem array of identical chromosome segments [each called 
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an amplicon] that contain two inverted megareplicons bordered by 
heterologous ["foreign"] DNA. Repeated cell fusion, growth on selective 
medium and/or BrdU [5-bromodeoxyuridine] treatment or other genome 
destabilizing reagent or agent, such as ionizing radiation, including X-rays, 
and subcloning results in cell lines that carry stable heterochromatic or 
partially heterochromatic chromosomes, including a 1 50-200 Mb 
"sausage" chromosome, a 500-1000 Mb gigachromosome, a stable 250- 
400 Mb megachromosome and various smaller stable chromosomes 
derived therefrom. These chromosomes are based on these repeating 
units and can include heterologous DNA that is expressed. 

Thus methods for producing MACs of both types are provided. 
These methods are applicable to any higher eukaryotic cell, including 
mammals, insects and plants. 

The resulting chromosomes can be purified by methods provided 
herein to provide vectors for introduction of the heterologous DNA into 
selected cells for production of the gene product encoded by the 
heterologous DNA, for production of transgenic animals and plants or for 
gene therapy. Vectors for chromosome fragmentation are provided. 
These vectors will be used to produce an artificial chromosome that 
contains a megareplicon, a centromere and two telomeres and will be 
between about 10 Mb and about 60 Mb, preferably between about 10 
Mb-15 Mb and 30 Mb. Such artificial chromosomes may be produced by 
other methods. Isolation of the 7.5 Mb [or 15 Mb amplicon containing 
two 7.5 Mb inverted repeats] or a 30 Mb multimer thereof should provide 
a stable chromosomal vector that can be manipulated in vitro . 

In addition, methods and vectors for fragmenting the 
minichromosomes and SATACs are provided. Such methods and vectors 
can be used for in vivo generation of smaller stable artificial 
chromosomes. Methods for reducing the size of the MACs to generate 
smaller stable self-replicating artificial chromosomes are also provided. 
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Methods and vectors for targeting heterologous DNA into the 
artificial chromosomes are also provided as are methods and vectors for 
fragmenting the chromosomes to produce smaller but stable and self- 
replicating artificial chromosomes. Vectors for targeted introduction of 
heterologous DNA into artificial chromosomes are provided, as are 

The chromosomes are introduced into cells to produce stable 
transformed cell lines or cells, depending upon the source of the cells. 
Introduction is effected by any suitable method including, but not limited 
to electroporation, direct uptake, such as by calcium phosphate [see, e.g., 
Wigler et aL (1979) Proc. Natl. Acad. Sci. U.S.A. 76:1373-1376; and 
Current Protocols in Molecular Biology. Vol. 1 , Wiley Inter-Science, 
Supplement 14, Unit 9.1.1-9.1.9 (1990)]. precipitation, uptake of isolated 
chromosomes by lipofection, by microcell fusion [see, EXAMPLES, see, 
also Lambert (1991) Proc. Natl. Acad. Sci. U.S.A. 88:5907-591 1 , U.S. 
Patent No. 5,396,767] or other suitable method. The resulting cells can 
be used for production of proteins in the cells. The chromosomes can be 
isolated and used for gene delivery. 

Methods for isolation of the chromosomes based on the DNA 
content of the chromosomes, which differs from the authentic 
chromsomes are provided. 

These artificial chromosomes can be used in gene therapy, gene 
product production systems, production of humanized organs, production 
of transgenic plants and animals, including invertebrates, vertebrate, 
reptiles and insects, any organism or device that would employ 
chromosomal elements as information storage vehicles, and also for 
analysis and study of centromere function, for the production of artificial 
chromosome vectors that can be constructed in vitro , and for the 
preparation of species- specific artificial chromosomes. The artificial 
chromosomes can be introduced into cells using microinjection, cell 
fusion, microcell fusion, electroporation, electrofusion, projectile 
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bombardment, calcium phosphate precipitation, site-specific targeting and 
other such methods. Cells particularly suited for use with the artificial 
chromosomes include, but are not limited to plant cells, particularly 
tomato, arabidopsis, and others, insect cells, including silk worm cells, 
5 insect larvae, fish, reptiles, amphibians, arachnids, mammalian cells, 
embryonic stem cells, embryos and cells for use in methods of genetic 
therapy, such as lymphocytes that are used in methods of adoptive 
immunotherapy and nerve or neural cells. Thus methods of producing 
gene products and transgenic animals and plants are provided. Also 
10 provided are the resulting transgenic animals and plants. 

Exemplary cell lines that contain these chromosomes are also 
provided. 

Methods for preparing artificial chromosomes for particular species 
and for cloning centromeres are also provided. In particular, a method for 

15 cloning a centromere from an animal or plant by preparing a library of 
DNA fragments that contain the genome of the plant or animal, 
introducing the each of the fragments into a mammalian satellite artificial 
chromosome [SATAC] that contains a centromere from a different 
species, generally a mammal, from the selected plant or animal, generally 

20 a non-mammal, and a selectable marker. The selected plant or animal is 
one in which the mammalian species centromere does not function. Each 
of the SATACs is introduced into the cells, which are grown under 
selective conditions, and cells with SATACs are identified. Such SATACS 
should contain a centromere encoded by the DNA from the library. 

25 Also provided are libraries in which the relatively large fragments of 

DNA are contained on artificial chromosomes. 

Transgenic animals, invertebrates and vertebrates, plants and 
insects, fish, reptiles, amphibians, arachnids and mammals are also 
provided. Of particular interest are transgenic animals that express genes 

30 that confer resistance or reduce susceptibility to disease. Since multiple 



6869-402A 



genes can be introduced on a MAC, a series of genes encoding an 
antigen can be introduced, which up expression will serve to immunize [in 
a manner similar to a multivalent vaccine] the host animal against the 
diseases for which exposure to the antigens provide immunity or some 
5 protection. 

Methods for cloning centromeres, such as mammalian centromeres, 
are also provided. In particular, in one embodiment, a library composed 
of fragments of the SATACs are cloned into YACs [yeast artificial 
chromosomes] that include a detectable marker, such as DNA encoding 
10 tyrosinase and then introduced into mammalian cells, such as albino 
mouse embryos. Mice produced from YACs that include a centromere 
that functions in mammals will express the detectable marker. Thus, if 
albino mice will be pigmented or have regions of pigmentation. 
DESCRIPTION OF THE DRAWINGS 
15 Figure 1 is a schematic drawing depicting formation of the 

MMCneo [the minichromosome] chromosome. A-G represents the 
successive events consistent with observed data that would lead to the 
formation and stabilization of the minichromosome. 

Figure 2 shows a schematic summary of the manner in which the 
20 observed new chromosomes would form, and the relationships among the 
different de novo formed chromosomes. In particular, this figure shows 
a schematic drawing of the de novo chromosome formation initiated in 
the centromeric region of mouse chromosome 7. (A) A single E-type 
amplification in the centromeric region of chromosome 7 generates a neo- 
25 centromere linked to the integrated "foreign" DNA, and forms a dicentric 
chromosome. Multiple E-type amplification forms the A neo-chromosome, 
which was derived from chromosome 7 and stabilized in a mouse-hamster 
hybrid cell line; (B) Specific breakage between the centromeres of a 
dicentric chromosome 7 generates a chromosome fragment with the neo- 
30 centromere, and a chromosome 7 with traces of heterologous DNA at the 
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end; (C) Inverted duplication of the fragment bearing the neo-centromere 
results in the formation of a stable neo-minichromosome; (D) Integration 
of exogenous DNA into the heterologous DNA region of the formerly 
dicentric chromosome 7 initiates H-type amplification, and the formation 
of a heterochromatic arm. By capturing a euchromatic terminal segment, 
this new chromosome arm is stabilized in the form of the "sausage" 
chromosome; (E) BrdU [5-bromodeoxyuridine], treatment and/or drug 
selection induce further H-type amplification, which results in the 
formation of an unstable gigachromosome: (F) Repeated BrdU treatments 
and/or drug selection induce further H-type amplification including a 
centromere duplication, which leads to the formation of another 
heterochromatic chromosome arm. It is split off from the chromosome 7 
by chromosome breakage, and by acquiring a terminal segment, the 
stable megachromosome is formed. 

Figure 3 Schematic diagram of the replicon structure and a scheme 
by which the megachromosome could be produced. 

Figure 4 sets forth the relationships among the some of the 
exemplary cell lines described herein. 

Figure 5 is diagram of the plasmid pTEMPUD. 
DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

Definitions 

Unless defined otherwise, all technical and scientific terms used 
herein have the same meaning as is commonly understood by one of skill 
in the art to which this invention belongs. All patents and publications 
referred to herein are incorporated by reference. 

As used herein, a mammalian artificial chromosome [MAC] is piece 
of DNA that can stably replicate and segregate alongside endogenous 
chromosomes. It has the capacity to accommodate and express 
heterologous genes inserted therein. It is referred to as a mammalian 
artificial chromosome because it includes an active mammalian 
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centromere. Plant artificial chromosomes and an insect artificial 
chromosomes refer to chromosomes that include plant and insect 
centromeres, respectively. A human specific chromosome [HAC] refers 
to chromosomes that include human centromeres, BUGACs refer to 
artificial insect chromosomes, and AVACs refer to avian artificial 
chromosomes. 

As used herein, stable maintenance of chromosomes, occurs when 
at least about 85%, preferably 90%, more preferably 95%, of the cells 
retain the chromosome. Stability is measured in the presence of selective 
agent. Preferably these chromosomes are also maintained in the absence 
of a selective agent. Stable chromosomes also retain their structure 
during cell culturing, suffering neither intrachromosomal nor 
interchromosomal rearrangements. 

As used herein, growth under selective conditions, means growth 
of a cell under conditions that require expression of a selectable marker 
for survival. 

As used herein, euchromatin and heterochromatin have their 
recognized meanings, euchromatin refers to DNA that contains genes, 
and heterochromatin refers to chromatin that has been thought to be 
inactive. Highly repetitive DNA sequences [satellite DNA] are located in 
regions of centromeric heterochromatin [pericentric heterochromatin]. 
Constitutive heterochromatin refers to heterochromatin that contains the 
highly repetitive DNA and that is constitutively condensed. 

As used herein, BrdU refers to 5-bromodeoxyuridine, which during 
replication is inserted in place of thymidine. BrdU is used as mutagen; it 
also inhibits condensation of metaphase chromosomes during cell 
division. 

As used herein, a dicentric chromosome is a chromosome that 
contains two centromeres. A multicentric chromosome contains more 
than two centromeres. 
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As used herein, a formerly dicentric chromosome is a chromosome 
that is produced when a dicentric chromosome fragments and acquires 
new telomeres so that two chromosomes, each having one of the 
centromeres, are produced. Each of the fragments, are replicable 
5 chromosomes. If one of the chromosomes undergoes amplification of 
euchromatic DNA to produce a full functionally chromosome that contains 
the heterologous DNA and primarily [at least more than 50%] 
euchromatin, it is a minichromosome. The remaining chromosome is a 
formerly dicentric chromosome. If one of the chromosomes undergoes 

10 amplification, whereby heterochromatin [satellite DNA] is amplified, a 
euchromatic portion [or arm remains], it is referred to as a sausage 
chromosome. A chromosome that is substantially all heterochromatin, 
except for portions of heterologous DNA, is called a SATAC. Such 
chromosomes [SATACs] can be produced from sausage chromosomes by 

15 culturing the cell containing the sausage chromosome under conditions, 
such as BrdU treatment and/or growth under selective conditions, that 
destabilize the chromosome so that a satellite artificial chromosomes 
[SATAC] is produced. For purposes herein, it is understand that SATACs 
may not necessarily be produced in multiple steps, but may appear after 

20 the initial introduction of the heterologous DNA and growth under 

selective conditions, or they may appear after several cycles of growth 
under selective conditions and BrdU treatment. 

As used herein an amplicon is the smallest repeated unit that 
contains heterologous DNA in the MACs provided herein. A megareplicon 

25 contains at least one amplicon, an inverted repeat thereof, and a 

megareplicator. A megareplicator is a primary replication initiation site. 

As used herein, the minichromosome refers to a chromosome 
derived from a dicentric chromosome [see, e.g. , FIG. 1] that is contains 
more euchromatic than heterochromatic DNA. 
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As used herein, a megachromosome refers to a chromosome that, 
except for introduced heterologous DNA is substantially composed of 
heterochromatin. Megachromosomes are made of a tandem arry of 
amplicons that containn two inverted megareplicons bordered by 
introduced heterologous DNA [see, e.g. . Figure 3 for a schematic drawing 
of a megachromosome]. For purposes herein, a megachromosome is 
about 50 to 400 Mb, generally about 250-400 Mb. Shorter variants, are 
also referred to a truncated megachromosomes [about 90 to 1 20 or 1 50 
Mb], dwarf megachromosomes [-150-200 Mb] and cell lines, and a 
micro-megachromosome [-60-90 Mb]. For purposes herein, the term 
megachromosome refers to the overall repeated structure based on a 
tandem array of repeated chromosomal segments [amplicons] that contain 
two inverted megareplicons bordered by any inserted heterologous DNA. 
The size will be specified. 

As used herein, genetic therapy involves the transfer of 
heterologous DNA to the certain cells, target cells, of an individual 
afflicted with a disorder for which such therapy is sought. The DNA is 
introduced into the selected target cells in a manner such that the 
heterologous DNA is expressed and a product encoded thereby is 
produced. Alternatively, the heterologous DNA may in some manner 
mediate expression of DNA that encodes the therapeutic product, it may 
encode a product, such as a peptide or RNA that in some manner 
mediates, directly or indirectly, expression of a therapeutic product. 
Genetic therapy may also be used to introduce therapeutic compounds, 
such as TNF, that are not normally produced in the host or that are not 
produced in therapeutically effective amounts or at a therapeutically 
useful time. The heterologous DNA encoding the therapeutic product may 
be modified prior to introduction into the cells of the afflicted host in order 
to enhance or otherwise alter the product or expression thereof. 
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As used herein, heterologous or foreign DNA and RNA are used 
interchangeably and refer to DNA or RNA that does not occur naturally as 
part of the genome in which it is present or which is found in a location 
or locations in the genome that differ from that in which it occurs in 
nature. It is DNA or RNA that is not endogenous to the cell and has been 
exogenously introduced into the cell. Examples of heterologous DNA 
include, but are not limited to, DNA that encodes a gene or gene(s) of 
interest, introduced for purposes of gene therapy or for production of the 
encoded protein. Other examples of heterologous DNA include, but are 
not limited to, DNA that encodes traceable marker proteins, such as a 
protein that confers drug resistance, DNA that encodes therapeutically 
effective substances, such as anti-cancer agents, enzymes and hormones, 
and DNA that encodes other types of proteins, such as antibodies. 
Antibodies that are encoded by heterologous DNA may be secreted or 
expressed on the surface of the cell in which the heterologous DNA has 
been introduced. 

As used herein, a therapeutically effective product is a product that 
is encoded by heterologous DNA that, upon introduction of the DNA into 
a host, a product is expressed that effectively ameliorates or eliminates 
the symptoms, manifestations of an inherited or acquired disease or that 

cures said disease. 

As used herein, transgenic plants refer to plants in which 
heterologous or foreign DNA is expressed or in which the expression of a 
gene naturally present in the plant has been altered. 

As used herein, operative linkage of heterologous DNA to 
regulatory and effector sequences of nucleotides, such as promoters, 
enhancers, transcriptional and translational stop sites, and other signal 
sequences refers to the relationship between such DNA and such 
sequences of nucleotides. For example, operative linkage of heterologous 
DNA to a promoter refers to the physical relationship between the DNA 
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and the promoter such that the transcription of such DNA is initiated from 
the promoter by an RNA polymerase that specifically recognizes, binds to 
and transcribes the DNA in reading frame. 

As used herein, isolated, substantially pure DNA refers to DNA 
fragments purified according to standard techniques employed by those 
skilled in the art, such as that found in Maniatis et al [(1 982) Molecular 
Cloning : A Laboratory Manual , Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY]. 

As used herein, expression refers to the process by which nucleic 
acid is transcribed into mRNA and translated into peptides, polypeptides, 
or proteins. If the nucleic acid is derived from genomic DNA, expression 
may, if an appropriate eukaryotic host cell or organism is selected, include 
splicing of the mRNA. 

As used herein, vector or plasmid refers to discrete elements that 
are used to introduce heterologous DNA into cells for either expression of 
the heterologous DNA or for replication of the cloned heterologous DNA. 
Selection and use of such vectors and plasmids are well within the level 
of skill of the art. 

As used herein, transformation/transfection refers to the process by 
which DNA or RNA is introduced into cells to for gene expression. 
Transfection refers to the taking up of an expression vector by a host cell 
whether or not any coding sequences are in fact expressed. Numerous 
methods of transfection are known to the ordinarily skilled artisan, for 
example, by direct uptake using calcium phosphate [CaP04; see,_e i g_ i , 
Wigler et aL (1 979) Proc. Natl. Acad. Sci. U.S.A. 76:1 373-1 376] and 
polyethylene glycol [PEG]-mediated DNA uptake and electroporation. 
Successful transfection is generally recognized when any indication of the 
operation of this vector occurs within the host cell. Transformation 
means introducing DNA into an organism so that the DNA is replicable, 
either as an extrachromosomal element or by chromosomal integrant. 
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As used herein, injected refers to the microinjection [use of a small 
syringe] of DNA into a cell. 

As used herein, substantially homologous DNA refers to DNA that 
includes a sequence of nucleotides that is sufficiently similar to another 
5 such sequence to form stable hybrids under specified conditions. 

It is well known to those of skill in this art, that nucleic acid 
fragments with different sequences may, under the same conditions, 
hybridize detectably to the same "target" nucleic acid. Two nucleic acid 
fragments hybridize detectably, under stringent conditions over a 
10 sufficiently long hybridization period, because one fragment contains a 
segment of at least about 14 nucleotides in a sequence which is 
complementary [or nearly complementary] to the sequence of at least one 
segment in the other nucleic acid fragment. If the time during which 
hybridization is allowed to occur is held constant, at a value during which, 
15 under preselected stringency conditions, two nucleic acid fragments with 
exactly complementary base-pairing segments hybridize detectably to 
each other, departures from exact complementarity can be introduced into 
the base-pairing segments, and base-pairing will nonetheless occur to an 
extent sufficient to make hybridization detectable. As the departure from 
20 complementarity between the base-pairing segments of two nucleic acids 
becomes larger, and as conditions of the hybridization become more 
stringent, the probability decreases that the two segments will hybridize 
detectably to each other. 

Two single-stranded nucleic acid segments have "substantially the 
25 same sequence," within the meaning of the present specification, if 

(a) both form a base-paired duplex with the same segment, and (b) the 
melting temperatures of said two duplexes in a solution of 0.5 X SSPE 
differ by less than 10°C. If the segments being compared have the same 
number of bases, then to have "substantially the same sequence", they 
30 will typically differ in their sequences at fewer than 1 base in 10. 
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Methods for determining melting temperatures of nucleic acid duplexes 
are well known [see, e.g. , Meinkoth and Wahl (1984) Anal. Biochem. 
138 :267-284 and references cited therein]. 

As used herein, a nucleic acid probe is a DNA or RNA fragment 
5 that includes a sufficient number of nucleotides to specifically hybridize to 
DNA or RNA that includes identical or closely related sequences of 
nucleotides. A probe may contain any number of nucleotides, from as 
few as about 10 and as many as hundreds of thousands of nucleotides. 
The conditions and protocols for such hybridization reactions are well 
10 known to those of skill in the art as are the effects of probe size, 

temperature, degree of mismatch, salt concentration and other parameters 
on the hybridization reaction. For example, the lower the temperature 
and higher the salt concentration at which the hybridization reaction is 
carried out, the greater the degree of mismatch that may be present in the 
15 hybrid molecules. 

To be used as an hybridization probe, the nucleic acid is generally 
rendered detectable by labelling it, with a detectable moiety or label, such 
as 32 P, 3 H and 14 C, or by other means, including chemical labelling, such 
as by nick-translation in the presence of deoxyuridylate biotinylated at the 
20 5'-position of the uracil moiety. The resulting probe includes the 

biotinylated uridylate in place of thymidylate residues and can be detected 
[via the biotin moieties] by any of a number of commercially available 
detection systems based on binding of streptavidin to the biotin. Such 
commercially available detection systems can be obtained, for example, 
25 from Enzo Biochemicals, Inc. [New York, NY]. Any other label known to 
those of skill in the art, including non-radioactive labels, may be used as 
long as it renders the probes sufficiently detectable, which is a function of 
the sensitivity of the assay, the time available [for culturing cells, 
extracting DNA, and hybridization assays], the quantity of DNA or RNA 
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available as a source of the probe, the particular label and the means used 

to detect the label. 

Once sequences with a sufficiently high degree of homology to the 

probe are identified, they can readily be isolated by standard techniques, 
5 which are described, for example, by Maniatis et aL ((1 982) Molecular 

Cloning: A Laboratory Manual , Cold Spring Harbor Laboratory Press, Cold 

Spring Harbor, NY). 

As used herein, conditions under which DNA molecules form stable 

hybrids and are considered substantially homologous are such that the 
10 DNA molecules with at least about 60% complementarity form stable 

hybrids. Such DNA fragments are herein considered to be "substantially 

homologous". For example, DNA that encodes a particular protein is 

substantially homologous to an other DNA fragment if the DNA forms 

stable hybrids such that the sequences of the fragments are at least 
15 about 60% complementary and if a protein encoded by the DNA retains 

its activity. 

For purposes herein, the following stringency conditions are 
defined: 

1) high stringency: 0.1 x SSPE, 0.1% SDS, 65°C 
20 2) medium stringency: 0.2 x SSPE, 0.1% SDS, 50°C 

3) low stringency: 1 .0 x SSPE, 0.1 % SDS, 50°C 
or any combination of salt and temperature and other reagents that result 
in selection of the same degree of mismatch or matching. 

As used herein, immunoprotective refers to the ability of a vaccine 
25 or exposure to an antigen or immunity-inducing agent to confer upon a 
host to whom the vaccine or antigen is administered or introduced the 
ability to resist infection by a disease causing pathogen or to have 
reduced symptoms. The selected antigen is typically an antigen that is 
presented by the pathogen. 
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As used herein, all assays and procedures, such as hybridization 
reactions and antibody-antigen reactions, unless otherwise specified, are 
conducted under conditions recognized by those of skill in the art as 
standard conditions. 

A. Preparation of cell lines containing MACs 

The methods, cells and MACs provided herein are produced by 
virtue of the discovery of the existence of a higher-order replication unit 
[megareplicon] of the centromeric region. This megareplicon is delimited 
by a primary replication initiation site [megareplicator], and appears to 
facilitate replication of the centromeric heterochromatin, and most likely, 
centromeres. Integration of heterologous DNA into the megareplicator 
region or in close proximity thereto, initiates a large-scale amplification of 
megabase-size chromosomal segments, which leads to de novo 
chromosome formation. 

Cell lines containing MACs can be prepared using cells, preferably a 
stable cell line, transforming it with a heterologous DNA fragment that 
encodes a selectable marker, culturing under selective conditions, and 
identifying cells that have a dicentric chromosome. These cells can then 
be manipulated as described herein to produce the minichromosomes and 
other MACs, particularly the heterochromatic SATACs as described 
herein. 

Development of a dicentric chromosome appears to require 
integration of the heterologous DNA in the pericentric heterochromatin. 
Thus, the probability of incorporation can be increased by including DNA, 
such as satellite DNA, in the heterologous fragment that encodes the 
selectable marker. The resulting cell lines can then be treated as the 
exemplified cells herein to produce cell in which the dicentric 
chromosome has fragmented and to introduce additional selective 
markers into the dicentric chromosome, whereby amplification of the 
pericentric heterochromatin will produce the heterochromatic 
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chromosomes. The following discussion is with reference to the EC7/3 
line and use of resulting cells. The same procedures can be applied and 
to any other cells, particularly cell lines to prepare create SATACs and 
euchromatic minichromosomes. 
5 1 . Formation of de novo chromosomes 

De novo centromere formation in a transformed mouse LMTK-fibro- 
blast cell line [EC3/7] after cointegration of A constructs WCM8 and 
/igtWESneo] carrying human and bacterial DNA [Hadlaczky et ah (1991) 
Proc. Natl. Acad. Sci. U.S.A. 88:81 06-81 1 0 and U.S. application Serial No. 
10 08/375,271] has been shown. The integration of the "heterologous" human 
and phage DNA, and the subsequent amplification of mouse and 
heterologous DNA that led to the formation of a dicentric chromosome, 
occurred at the centromeric region of the short arm of a mouse 
chromosome. By G-banding this chromosome was identified as mouse 
15 chromosome 7. Because of the presence of two functionally active 
centromeres on the same chromosome, regular breakages occur between the 
centromeres. Such specific chromosome breakages gave rise to the 
appearance [in approximately 10% of the cells] of a chromosome fragment 
carrying the neo-centromere. From the EC3/7 cell line [see, U.S. Patent No. 
20 5,288,625, deposited at the European Collection of Animal cell Culture 
(hereinafter ECACC) under accession no. 90051001]; see, also Hadlaczky 
eta] M9911 Proc. Natl. Acad. Sci. U.S.A. 88:81 06-81 1 0, and U.S. applica- 
tion Serial No. 08/375,271 and the corresponding published European 
application EP 0 473 253] carrying either the dicentric chromosome or a 
25 chromosome fragment with the neo-centromere, two sublines [EC3/7C5 and 
EC3/7C6] were selected by repeated single-cell cloning. In these cell lines, 
the neo-centromere was found exclusively on a minichromosome [neo- 
minichromosome]), while the formerly dicentric chromosome carried traces 
of "heterologous" DNA. 
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It has now been discovered that integration of DNA encoding a 
selectable marker in the heterochromatic region of the centromere led to 
formation of the dicentric chromosome. 
2. The neo-minichromosome 
5 The chromosome breakage in the EC3/7 cells, which separates the 

neo-centromere from the mouse chromosome, occurred in the G-band 
positive "heterologous" DNA region. This is supported by the observation of 
traces of A and human DNA sequences at the broken end of the formerly 
dicentric chromosome. Comparing the G-band pattern of the chromosome 
10 fragment carrying the neo-centromere with that of the stable neo- 
minichromosome, it is apparent that the neo-minichromosome is an inverted 
duplicate of the chromosome fragment that bears the neo-centromere. This 
is supported by the observation that although the neo-minichromosome 
carries only one functional centromere, both ends of the minichromosome 
15 are heterochromatic, and mouse satellite DNA sequences were found in 
these heterochromatic regions by in situ hybridization. 

Mouse cells containing the minichromosome, which is composed of 
multiple repeats of the heterologous DNA, which in the exemplified 
embodiment is lambda DNA and neo DNA, can be used as recipient cells in 
20 cell transformation. Donor DNA, such as selected heterologous DNA linked 
to a second selectable marker, such as hygromycin resistance [hyg], can be 
introduced into the mouse cells and integrated into the minichromosomes by 
homologous recombination of lambda DNA in the donor DNA with that in the 
minichromosomes. Integration is verified by in situ hybridization and 
25 Southern blot analyses. Transcription and translation of the heterologous 
DNA is confirmed by primer extension and immunoblot analyses. 

For example, DNA has been targeted into the A-neo minichromosome 
in EC3/7C5 cells using a lambda DNA-containing construct [pNeml rue] that 
also contains DNA encoding hygromycin resistance and the Renilla luciferase 
30 gene linked to a promoter, such as the cytomegalovirus [CMV] early 
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promoter, and the bacterial neo encoding DNA. Integration of the donor 
DNA into the chromosome in selected cells [designated PHN4] was 
confirmed by nucleic acid amplification [PCR] and in situ hybridization. 
Events that would produce a neo-minichromosome are depicted in Figure 1 . 

The resulting engineered minichromosome that contains the heterolo- 
gous DNA can then transferred by cell fusion into a recipient cell line, such 
as Chinese hamster kidney cells [CHO] and correct expression of the 
heterologous DNA can be verified. Following production of the cells, 
metaphase chromosomes are obtained, such as by addition of colchicine, 
and the chromosomes purified by addition of AT and GC specific dyes on a 
dual laser beam based cell sorter. Preparative amounts of chromosomes 
[2-3 mis of 1 0 6 cells/ml] at a purity of 95% or higher can be obtained. The 
resulting chromosomes are used for delivery to cells by methods, such as 
microinjection, liposome packaged transfer. 

Thus, the neo-minichromosome is stably maintained in cells, replicates 
autonomously, and permits the persistent long-term expression of neo gene 
under non-selective culture conditions. It also contains megabases of 
heterologous known DNA [lambda DNA in the exemplified embodiments] 
that serves as target sites form homologous recombination and integration 
of DNA of interest. The neo-minichromosome is, thus, a vector for genetic 
engineering of cells. 

The methods herein provide means to induce the events that led to 
formation of the neo-minichromosome by introducing heterologous DNA with 
a selective marker [preferably a dominant selectable marker] and culturing 
under selective conditions. As a result, cells that contain a dicentric 
chromosome or fragments thereof produced by amplification, will be 
produced. Cells with the dicentric chromosome can then be treated to 
destabilize the genome with agents, such as BrdU and/or culturing under 
selective conditions, resulting in cells in which the dicentric chromosome has 
formed two chromosomes, a so-called minichromosome, and a formerly 
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dicentric chromosome that has typically undergone amplification in the 
heterochromatin where the heterologous DNA has integrated to produce a 
generally a SATAC or a sausage chromosome [discussed below]. These 
cells can be fused with other cells to separate the minichromosome from the 
formerly dicentric chromosome into different cells so that each type of MAC 
can be manipulated separately. 

3. Preparation of SATACs 

An Exemplary protocol for preparation of SATACs is illustrated in 
Figure 2 [particularly C, D and F] and FIGURE 4 [see, also the EXAMPLES, 
particularly EXAMPLES 4-7]. 

To prepare a SATAC, the starting materials are a cell, preferably a 
stable cell line, such as a fibroblast cell line, and a DNA fragment that 
includes DNA that encodes a selective marker. To insure integration of the 
DNA fragment in the heterochromatin, it is preferable to start with DNA that 
will be targeted to the pericentric heterochromatic region of the 
chromosome, such as/\CM8 and vectors provided herein, such as pTEMPUD 
[Figure 5] that include satellite DNA. After introduction of the DNA, the 
cells are grown under selective conditions. The resulting cells are examined 
and any that have dicentric chromosomes [or heterochromatic chromosomes 
or sausage chromosomes or other such structure [see, Figure 2C, 2D, 2E 
and 2F] are selected. 

In particular, if a cell with a dicentric chromosome is selected, it can 
be grown under selective conditions, or, preferably, additional DNA encoding 
a second selectable marker is introduced, and the cells grown under 
conditions selective for the second marker. The resulting cells should 
include chromosomes that have structures similar to those depicted in 
Figures 2D, 2E, 2F. Cells with a structure, such as the sausage 
chromosome, Figure 2D, can be selected and fused with a second cell line 
to eliminate other chromosomes that are not of interest. If desired cells with 
other chromosomes can be selected and treated as described herein. If a 
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cell with a sausage chromosome is selected, it is treated with an agent, such 
as BrdU, that destabilizes the chromosome so that the heterochromatic arm 
forms a chromosome that is substantially heterochromatin 
[megachromosome, see, Figure 2F] . Structures such as the gigachromsome 
in which the heterochromatic arm has amplified but not broken off from the 
euchromatic arm, will also be observed. The megachromosome is a stable 
chromosome. Further manipulation, such as fusions and growth in selective 
conditions and/or BrdU treatment or other such treatment, can lead to 
fragmentation of the megachromosome to form smaller chromosomes that 
have the amplicon as the basic repeating unit. 

The megachromosome can be further fragmented in yjvo using a 
chromosome fragmentation vector, such as pTEMPUD [see, Figure 5 and 
EXAMPLE 12] to ultimately produce a chromosome that comprises the 
smallest stable replicable unit, about 15 Mb-50 Mb, containing two to four 
megareplicons. 

Thus,The stable chromosomes formed de novo that originate from the 
short arm of mouse chromosome 7 have been analyzed. This chromosome 
region shows a capacity for amplification of large chromosome segments, 
and promotes de novo chromosome formation. Large-scale amplification at 
the same chromosome region leads to the formation of dicentric and 
multicentric chromosomes, a minichromosome, the 1 50-200 Mb s\ze A neo- 
chromosome, the "sausage" chromosome, the 500-1000 Mb 
gigachromosome, and the stable 250-400 Mb megachromosome. 

A clear segmentation is observed along the arms of the 
megachromosome, and analyses show that the building units of this 
chromosome are amplicons of -30 Mb composed of mouse major satellite 
DNA with the integrated "foreign" DNA sequences at both ends. The -30 
Mb amplicons are composed of two ~ 1 5 Mb inverted doublets of -7.5 Mb 
mouse major satellite DNA blocks, which are separated from each other by 
a narrow band of non-satellite sequences [see, e^, Figure 3]. The wider 
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non-satellite regions at the amplicon borders contain integrated, exogenous 
[heterologous] DNA, while the narrow bands of non-satellite DNA sequences 
within the amplicons are integral parts of the pericentric heterochromatin of 
mouse chromosomes. These results indicate that the -7.5 Mb blocks 
flanked by non-satellite DNA are the building units of the pericentric 
heterochromatin of mouse chromosomes, and the -15 Mb size pericentric 
regions of mouse chromosomes contain two -7.5 Mb units. 

Apart from the euchromatic terminal segments, the whole 
megachromosome is heterochromatic, and has structural homogeneity. 
Therefore, this large chromosome offers a unique possibility for obtaining 
information about the amplification process, and for analyzing some basic 
characteristics of the pericentric constitutive heterochromatin, as vector for 
heterologous DNA, and as target for further fragmentation. 

As shown herein, this phenomenon is generalizable and can be 
observed with other chromosomes. Also, although these de novo formed 
chromosome segments and chromosomes appear different, there are, 
similarities that indicate that a similar amplification mechanism plays a role 
in their formation: (i) in each case, the amplification is initiated in the 
centromeric region of the mouse chromosomes and large (Mb size) 
amplicons are formed; (ii) mouse major satellite DNA sequences are constant 
constituents of the amplicons, either by providing the bulk of the 
heterochromatic amplicons [H-type amplification], or by bordering the 
euchromatic amplicons [E-type amplification]; (iii) formation of inverted 
segments can be demonstrated in the A neo-chromosome and 
megachromosome; (iv) chromosome arms and chromosomes formed by the 
amplification are stable and functional. 

The presence of inverted chromosome segments seems to be a 
common phenomenon in the chromosomes formed de novo at the 
centromeric region of mouse chromosome 7. During the formation of the 
neo-minichromosome, the event leading to the stabilization of the distal 
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segment of mouse chromosome 7 that bears the neo-centromere may have 
been the formation of its inverted duplicate. Amplicons of the 
megachromosome are inverted doublets of -7.5 Mb mouse major satellite 
DNA blocks. 

4. Cell lines 

Cell lines that contain MACs, such as the minichromosome, the A-neo 
chromosome, and the SATACs are provided herein or can be produced by 
the methods herein. Such cell lines provide a convenient source of these 
chromosomes and can be manipulated, such as by cell fusion or production 
of microcells for fusion with selected cell lines, to deliver the chromosome 
of interest into hybrid cell lines. Exemplary cell lines are described herein 
and some have been deposited with the ECACC. 

a. EC3/7C5 and EC3/7C6 

Two cell lines EC3/7C5 and EC3/7C6 produced by single cell cloning 
of EC3/7 were examined. For exemplary purposes EC3/7C5 has been 
deposited with the ECACC. These cell lines contain a minichromosome and 
the formerly dicentric chromosome from EC3/7. The stable mini- 
chromosomes in cell lines [EC3/7C5 and EC3/7C6] appear to be virtually 
identical and they seem to be a duplicated derivatives of the -10-15 Mb 
"broken-off" fragment of the dicentric chromosome. Their identical size in 
these independently generated cell lines might indicate that -20-30 Mb is 
the minimal or close to the minimal physical size for a stable 
minichromosome. 

b. TF1004G/19 

Introduction of additional heterologous DNA, including a second 
selectable marker hygromycin and also a detectable marker £-galactosidase, 
into the EC3/7C5 cell line and growth under selective conditions produced 
cells designated TF1 004G/1 9. In particular, this cell line was produced from 
the EC3/7C5 cell line by cotransfection with plasmids pH132, which 
contains an anti-HIV ribozyme, and hygromycin resistance gene, pCH110 
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[encodes £-galactosidase] and A phage Ucl 875 Sam 7] DNA and selection 
with hygromycin B. 

Detailed analysis of TF1 004G/1 9 cell line by in situ hybridization with 
lambda phage and plasmid DNA sequences revealed the formation of the 
sausage chromosome. The formerly dicentric chromosome of EC3/7C5 cell 
translocated to the end of another acrocentric chromosome. The 
heterologous DNA integrated into the pericentric heterochromatin of formerly 
dicentric chromosome and is amplified several times with megabases of 
mouse pericentric heterochromatic satellite DNA sequences [Fig. 2D]) 
forming the "sausage" chromosome. Subsequently the acrocentric mouse 
chromosome was substituted by a euchromatic telomere. 

In situ hybridization with biotin labeled subfragments of the 
hygromycin resistance and /?-galactosidase genes resulted in hybridization 
signal only in the heterochromatic arm of the sausage chromosome, 
indicating that in TF1 004G/1 9 transformant cells these genes are localized 
in the pericentric heterochromatin. A high level of gene expression, 
however, was detected. 

In general, heterochromatin has a silencing effect in Drosophila, yeast 
and on the HSV-tk gene introduced into satellite DNA at mouse centromere. 
Thus, it was of interest to study the TF1004G/19 transformant cell line in 
to confirm that gene expression [of the £-gal] was indeed localized in the 
heterochromatin contrary to recognized dogma. 

For this purpose, subclones of TF1004G/19 containing a different a 
sausage chromosome [see Figure 2D] by single cell cloning were estab- 
lished. Southern DNA hybridization with subfragments of hygromycin 
resistance and /?-galactosidase genes showed close correlation to the 
intensity of hybridization and the length of sausage chromosome. This 
finding supports the conclusion that these genes are localized in the 
heterochromatic arm of the sausage chromosome. 
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(1) TF1004G-19C5 

TF1004G-19C5 is a mouse LMTK" fibroblast cell line containing neo- 
minchromosomes and stable "sausage" chromosomes. It is a subclone of 
TF1 004G/1 9. It has been deposited as an exemplary cell line and exemplary 
source of a sausage chromosome. Subsequent fusion of this cell line with 
CHO K20 cells and selection with hygromycin and hat resulted in hybrid cells 
that carry the sausage chromosome and/or the neo-minichromosome. BrdU 
treatment, single cell cloning and selection with G418 and/or hygromycin 
produced various cells that carry chromosomes of interest, including G3D5. 

(2) other subclones 

G3D5, which has been deposited is a mouse hamster hybrid cell line 
that carries the neo-minichromosome and the megachromosome. H1 D3 is 
a subclone thereof that carries the megachromosome. Fusion of this cell line 
with the CD4 + Hela cell line and also DNA encoding an additional selection 
gene, neo, produced cells that carry the megachromosome as well as a 
human chromosome that carries CD4neo [H1D3 cells]. Further BrdU 
treatment and single cell cloning produced cell lines, such as 1 B3 that 
include cells with a truncated megachromosome. 

5. DNA constructs used to transform the cells 
Heterologous DNA can be introduced into the cells by transfection or 
other suitable method at any stage during preparation of the chromosomes 
[see, e.g. , FIG. 4]. In general integration of such DNA is assured by relying 
on site directed integration, such as by inclusion of /1-DNA for the 
exemplified chromosomes and also an additional selective marker gene. For 
example, cells with a MAC, such as the minichromosome or a SATAC can 
be cotransfected with a plasmid encoding the desired heterologous DNA, 
such as HIV ribozyme, cystic fibrosis gene, and a second selectable marker, 
such as hygromycin resistance. Selection is effected with the agent that 
selects for the new selectable marker cells containing chromosomes that 
include the DNA in the MAC are identified. Fusion with a second cell line 
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can provide a means to produce cell lines that contain one particular type of 
chromosomal structure or MAC. 

Various vectors for this purpose are provided herein [see, Examples] 
and others can be readily constructed. The vectors should include DNA that 
5 will target the DNA to the MAC, a selectable marker and the selected 
heterologous gene of interest. Based on the disclosure herein and the 
knowledge of the skilled artisan, one of skill can construct such vectors. 

Of particular interest herein is the vector pTEMPUD and derivatives 
thereof that can target DNA into the heterochromatic region of selected 
1 0 chromosomes. These vectors can also serve as fragmentation vectors [see, 

e.g. . Example 1 2]. 

Heterologous genes of interest include any gene that encodes a 
therapeutic gene and DNA encoding gene products of interest. These genes 
and DNA include, but are not limited to: the cystic fibrosis gene [CF] cystic 
15 fibrosis transmembrane regulator (CFTR) [see, e^, U.S. Patent No. 
5,240,846; Rosenfeld et aL (1992) Cell 68:143-155; Hyde et ah (1993) 
Nature 362 : 250-255; Kerem et aL ( 1 989) Science 245: 1 073- 1 080; Riordan 
et aL(1989) Science 245:1066-1072; Rommens et aL (1989) Science 
245:1059-1065; Osborne et aL (1991) Am. J. Hum. Genetics 
20 48:6089-61 22; whit* gt gl_ M 9901 Nature 344 :665-667; Dean et aL (1 990) 
Ceil 61:863-870; Erlich et aL (1 991 ) Science 252:1 643; and U.S. Patent 
Nos. 5,453,357, 5,449,604, 5,434,086, and 5,240,846, which provides 
a retroviral vector encoding the normal CFTR gene]. 
B. Isolation of artificial chromosomes 
25 The MACs provided herein can be isolated by any suitable method 

known to those of skill in the art. Also, a method is provided herein for 
effecting substantial purification. SATACs have been isolated by 
fluorescence activated cell sorting [FACS]. This method takes advantage of 
the nucleotide base content of the SATACs, which by virtue of their 
30 heterochromatic content will differ from any other chromosomes in a cell. 
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In particular, metaphase chromosomes are isolated and stained with base 
specific dyes, such as Hoechst 33258 and chromocycin A3. Fluorescence 
activated cell sorting will separate the SATACs from the genomic 
chromosomes. A dual-laser cell sorter [FACStar Plus and FAXStar Vantage 
Becton Dickinson Immunocytometry System) in which two lasers were set 
to excite the dyes separately, allowed a bivariate analysis of the 
chromosomes by size and base-pair composition. Cells containing such 
SATACs can be similarly sorted. 

C. Introduction of artificial chromosomes into cells, tissues, animals and 
plants 

Suitable hosts for introduction of the MACs provided herein, include 
but are not limited to any animal or plant, cell or tissue thereof, including,but 
not limited to: mammals, birds, reptiles, amphibians, insects, fish, arachnids, 
tobacco, tomato, wheat, monocots, dicots and algae. The MACs may be 
introduced by cell fusion or microcell fusion or subsequent to isolation by 
any method known to those of skill in this art, including but not limited to: 
direct DNA transfer, electroporation, lipofection, liposomes, microprojectile 
bombardment, microinjection and any other suitable method. 

Other methods for introducing DNA into cells, include nuclear 
microinjection, electroporation, bacterial protoplast fusion with intact cells. 
Polycations, such as polybrene and polyornithine, may also be used. For 
various techniques for transforming mammalian cells, see e^, Keown et aL 
Methods in Enzvmoloav (1990) Vol. 185, pp. 527-537; and Mansour et ah 
(1988) Nature 336 :348-352. 

DNA may be introduced by direct DNA transformation; microinjection 
in cells or embryos, protoplast regeneration for plants, electroporation, 
microprojectile gun and other such methods [see, e^, Weissbach et aL 
(1 988) Methods for Plant Molecular Biology, Academic Press, N.Y., Section 
VIII, pp. 421-463; Grierson et aL (1988) Plant Molecular Biology, 2d Ed., 
Blackie, London, Ch. 7-9; see, also U.S. Patent Nos. 5,491,075; 
5,482,928; and 5,424,409; see, also, e^, U.S. Patent No. 5,470,708, 
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which describes particle-mediated transformation of mammalian unattached 
cells]. 

For example, isolated purified artificial chromosomes can be injected 
into an embryonic cell line such as a human kidney primary embryonic cell 
5 line [ATCC CRL 1573] or embryonic stem cells [see, e^, Hogan et aL 
(1 994) Manipulating the Mouse Embryo, A .Laboratory Manual, Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, NY, see, especially, pages 
255-264 and Appendix 3]. Preferably the chromosomes are introduced by 
microinjection, using a system such as the Eppendorf automated 
10 microinjection system, and grown under selective conditions, such as 
hygromycin B or neomycin resistance. 

1 . Methods for introduction of chromosomes into hosts 
Depending on the host cell used, transformation is done using 
standard techniques appropriate to such cells. These methods include any, 
15 including those described herein, known to those of skill in the art. 
a. DNA uptake 
For mammalian cells without such cell walls, the calcium phosphate 
precipitation method [see, e^, Graham et aL_ d 978) Virology 52:456-457 
is often preferred. DNA uptake can be accomplished by DNA alone or in the 
20 presence of polyethylene glycol [PEG-mediated gene transfer], which is a 
fusion agent, with plant protoplasts or by any variations of such methods 
known to those of skill in the art [see, et al U.S. Pat. No. 4,684,61 1]. 

A commonly used approach for gene transfer in land plants involves the 
direct introduction of purified DNA into protoplasts. The three basic 
25 methods for direct gene transfer include: 1) polyethylene glycol 
[PEG]-mediated DNA uptake, 2) electroporation -mediated DNA uptake and 
3) microinjection. In addition, plants may be transformed using ultrasound 
treatment [see, e.g. . International PCT application No. WO 91/00358]. 
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b. Electroporation 

Electroporation, which involves providing high-voltage electrical 
pulses to a solution containing a mixture of protoplasts and foreign DNA to 
create reversible pores in the membranes of plant protoplasts as well as 
other cells. Electroporation is generally used for prokaryotes or other cells, 
such as plants that contain substantial cell-wall barriers. Methods for 
effecting electroporation are well known [see, e.g., U.S. Patent Nos. 
4,784,737, 5,501,967, 5,501,662, 5,019,034, 5,503,999; see, also 
Frommet aL (1985) Pmr.. Natl. Acad. Sci. U.S.A. 82:5824-5828]. 

For example, electroporation is often used for transformation of plants 
[see, e^, Ag Biotechnology News, Vol. 7 p. 3 and 17 (September/October 
1990)]. In this technique, plant protoplasts are electroporated in the 
presence of the DNA of interest that also includes a phenotypic marker. 
Electrical impulses of high field strength reversibly permeabilize 
biomembranes allowing the introduction of the plasmids. Electroporated 
plant protoplasts reform the cell wall, divide, and form plant callus. 
Transformed plant cells will be identified by virtue of the expressed 
phenotypic marker. The exogenous DNA may be added to the protoplasts in 
any form such as, for example, naked linear, circular or supercoiled DNA, 
DNA encapsulated in liposomes, DNA in spheroplasts, DNA in other plant 
protoplasts, DNA complexed with salts, and other methods, 
c. Microcells 

The chromosomes can be transferred by preparing microcells 
containing an artificial chromosome and then fusing with selected target 
cells. Methods for such preparation and fusion or microcells are well known 
[see, <u^, U.S. Patent Nos. 5,240,840, 4,806,476, 5,298,429, Fournier 
(1981) Proc. Natl. Acad. Sci. U.S.A. 78:6349-6353; and Lambert et aL 
(1991) Proc. Natl. Ar.ad. Sci. U.S.A. 88:5907-59]. 
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2. Hosts 

Suitable host include any host known to be useful for introduction and 

expression of heterologous DNA. Of particular interest herein, animal and 

plant cells and tissues, including, but not limited to insect cells and larvae, 

plants, and animals, particularly transgenic animals, and animal cells. Other 

hosts include, but are not limited to mammals, birds, reptiles, amphibians, 

insects, fish, arachnids, tobacco, tomato, wheat, monocots, dicots and 

algae, and any host into which introduction of heterologous DNA is desired. 

Such introduction can be effected using the MACs provided herein, or, if 

necessary by using the MACs provided herein to identify species-specific 

centromeres and/or functional chromosomal units and then using the 

resulting centromeres or chromosomal units as artificial chromosomes, or 

alternatively, using the methods exemplified herein for production of MACs 

to produce species-specific artificial chromosomes. 

a. Introduction of DNA into embryos for production of 
transgenic animals and introduction of DNA into animal 
cells 

Transgenic animals can be produced by introducing exogenous genetic 
material into a pronucleus of a mammalian zygote by microinjection [see, 
e.g. , U.S. Patent Nos. 4,873,191 and 5,354,674; see, also, International 
PCT application No. W095/1 4769, which is based on U.S. application Serial 
No. 08/159,084]. The zygote is capable of development into a mammal. 
The embryo or zygote is transplanted into a host female uterus and allowed 
to develop. Detailed protocols and examples are set forth below. 

DNA can be introduced into animal cells using any known procedure, 
including, but not limited to: direct uptake, incubation with polyethylene 
glycol [PEG], microinjection, electroporation, lipofection, cell fusion, microcell 
fusion, particle bombardment, including microprojectile bombardment [see, 
e.g. , U.S. Patent No. 5,470,708, which provides a method for transforming 
unattached mammalian cells via particle bombardment], and any other such 
method. For example, the transfer of plasmid DNA in liposomes directly to 
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human cells in situ has been approved by the FDA for use in humans [see, 
e.g. , Nabel, et aL (1990) Science 249:1285-1288 and U.S. Patent No. 
5,461,032]. 

b. Introduction of heterologous DNA into plants. 

5 Numerous methods for producing or developing transgenic plants are 

available to those of skill in the art. The method used is primarily a function 
of the species of plant. These methods include, but are not limited to: direct 
transfer of DNA by processes, such as PEG-induced DNA uptake, protoplast 
fusion, microinjection, electroporation, and microprojectile bombardment 
10 [see, e^cu, Uchimiya et aL (1 989) J. of Biotech. 1 2: 1 -20 for a review of 
such procedures, see, also, e^u, U.S. Patent Nos. 5,436,392 and 
5,489,520 and many others]. For purposes herein, when introducing a 
MAC, microinjection and protoplast fusion are preferred 

Plant species, including tobacco, rice, maize, rye, soybean, Brassica 
15 nanus , cotton, lettuce, potato and tomato, have been used to produce 
transgenic plants. Tobacco and other species, such as petunias, often serve 
as experimental models in which the methods have been developed and the 
genes first introduced and expressed. 

DNA uptake can be accomplished by DNA alone or in the presence of 
20 PEG, which is a fusion agent, with plant protoplasts or by any variations of 
such methods known to those of skill in the art [see, e^, U.S. Patent No. 
4,684,611 to Schilperoot et aL]. Electroporation, which involves high- 
voltage electrical pulses to a solution containing a mixture of protoplasts and 
foreign DNA to create reversible pores, has been used, for example, to 
25 successfully introduce foreign genes into rice and Brassica napus . 
Microinjection of DNA into plant cells, including cultured cells and cells in 
intact plant organs and embryoids in tissue culture and microprojectile 
bombardment [acceleration of small high density particles, which contain the 
DNA, to high velocity with a particle gun apparatus, which forces the 
30 particles to penetrate plant cell walls and membranes] have also been used. 
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All plant cells into which DNA can be introduced and that can be 
regenerated from the transformed cells can be used to produce transformed 
whole plants which contain the transferred chromosome. The particular 
protocol and means for introduction of the DNA into the plant host may need 
5 to be adapted or refined to suit the particular plant species or cultivar. c . 
Insect cells 

Insects are useful hosts for introduction of artificial chromosomes for 
numerous reasons, including, but not limited to: (a) amplification of genes 
encoding useful proteins can be accomplished in the artificial mammalian 

10 chromosome to obtain higher protein yields in insect cells; (b) insect cells 
support required posttranslational modifications, such as glycosylation and 
phosphorylation, that can be required for protein biological functioning; (c) 
insect cells do not support mammalian viruses, and, thus, eliminate the 
problem of cross-contamination of products with such infectious agents; (d) 

1 5 this technology circumvents traditional recombinant baculovirus systems for 
production of nutritional, industrial or medicinal proteins in insect cell 
systems; (e) the low temperature optimum for insect cell growth (28° C) 
permits reduced energy cost of production; (f) serum free growth medium 
for insect cells permits lower production costs; (g) artificial chromosome- 

20 containing cells can be stored indefinitely at low temperature; and (h) insect 
larvae will be biological factories for production of nutritional, medicinal or 
industrial proteins by microinjection of fertilized insect eggs [see, e.g. , Joy 
et aL (1991) Current Science 66:145-150, which provides a method for 
microinjecting heterologous DNA into Bombyx mori eggs]. 

25 Either MACs or insect-specific [BUGACs] will used to introduce genes 

into insects. As described in the Examples, it appears that MACs will 
function in insects to direct expression of heterologous DNA contained 
thereon. 

Insect host cells include, but are not limited to, hosts such as 
30 Spodoptera frugiperda [caterpillar], Aedes aegypti [mosquito], Aedes 
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albopictus [mosquito], Drosphila me/anogaster [fruitfly], Bombyx mori 
[silkworm] and Trichoplusia ni [cabbage looper] . Effort haves been directed 
toward propagation of insect cells in culture. Such efforts have focused on 
the fall armyworm, Spodoptera frugiperda. The cell lines have been 
5 developed also from other insects such as the cabbage looper, trichoplusia 
ni and the silkworm, Bombyx mori. It has also been suggested that 
analogous cell lines can be created using the tobacco hornworm, or 
Manduca sexta. To introduce DNA into an insect, it should be introduced 
into the larvae, and allowed to proliferate, and then the hemolymph 
10 recovered from the larvae so that the proteins can be isolated therefrom. 
The preferred method herein is microinjection [see, e^, Tamura et aL 
(1991) Bio Ind. 8:26-31; Nikolaev et al (1989) Mol. Biol. (Moscow) 
23;1 177-87; and methods exemplified and discussed herein]. 
D. Applications for and Uses of Artificial chromosomes 
1 5 Artificial chromosomes provide convenient and useful vectors, and in 

some instances the only vectors for introduction of heterologous genes into 
hosts. Virtually any gene of interest is amenable to introduction into a host 
via an artificial chromosomes. Such genes include, but are not limited to 
genes that encode receptors, cytokines, enzymes, proteases, hormones, 
20 growth factors, tumor suppressor genes, therapeutic products and multigene 
pathways. 

The artificial chromosomes provided herein will be used in methods 
of protein and gene product production, particularly using insects as host 
cells for production of such products. They are also intended for use in 
25 methods of gene therapy, and in for production of transgenic plants and 
animals [discussed above, below and in the EXAMPLES]. 
1 . Gene Therapy 

Any therapeutic gene product or product of a multigene pathway may 
be introduced into a host animal, such as a human, or into a target cell line 
30 for introduction into an animal, for therapeutic purposes. Such therapeutic 
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purposes include, genetic therapy to cure or to provide gene products that 
are missing or defective, to deliver agents, such as anti-tumor agents, to 
targeted cells or to an animal, and to provide gene products that will confer 
resistance or reduce susceptibility to a pathogen or ameliorate symptoms of 
a disease or disorder. The following are some exemplary genes and gene 
products. Such exemplification is not intended to be limiting. 

a. Anti-HIV ribozymes 

As exemplified below, DNA encoding anti-HIV ribozymes can be 
introduced and expressed using MACs, including the euchromatin-based 
minichromosomes and the SATACs. These MACs can be used to 

make a transgenic mouse with that express a ribozyme and, thus, serve as 
a model for testing the activity of such ribozymes or from which ribozyme- 
producing cell lines can be made. Also, introduction of a MAC into human 
cells that encodes an anti-HIV ribozyme will serve as treatment for HIV 
infection. 

b. Tumor Suppressor Genes 

Tumor suppressor genes are genes that, in their wild-type alleles, 
express proteins that suppress abnormal cellular proliferation. When the gene 
coding for a tumor suppressor protein is mutated or deleted, the resulting 
mutant protein or the complete lack of tumor suppressor protein expression 
may fail to correctly regulate cellular proliferation, and abnormal cellular 
proliferation may take place, particularly if there is already existing damage 
to the cellular regulatory mechanism. A number of well-studied human 
tumors and tumor cell lines have been shown to have missing or 
nonfunctional tumor suppressor genes. 

Examples of tumor suppression genes include, but are not limited to, 
the retinoblastoma susceptibility gene or RB gene, the p53 gene, the deleted 
in colon carcinoma [DCC] gene and the neurofibromatosis type 1 [NF-1] 
tumor suppressor gene [see, e.g., U.S. Patent No. 5,496,731 ; Weinberg et 
al. (1991) 254 :1 138-1 146]. Loss of function or inactivation of tumor 
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suppressor genes may play a central role in the initiation and/or progression 
of a significant number of human cancers. 
The p53 Gene 

Somatic cell mutations of the p53 gene are said to be the most frequently 
5 mutated gene in human cancer [see, e^, Weinberg et aL (1 991 ) Science 
254 :1 138-1 146]. The normal or wild-type p53 gene is a negative regulator 
of cell growth, which, when damaged, favors cell transformation. The p53 
expression product is found in the nucleus, where it may act in parallel with 
or cooperatively with other gene products. Tumor cell lines in which p53 has 
10 been deleted have been successfully treated with wild-type p53 vector to 
reduce tumorigenicity [see, Baker et al (1 990) Science 249:91 2-91 5]. 

DNA encoding the p53 gene and plasmids containing this DNA are 
well known [see, e^, U.S. Patent No. 5,260,191; see, also Chen et aL 
(1990) Science 250:1576; Farrel et aL (1991) EMBO J, 10:2879-2887, 
1 5 plasmids containing the gene are available from the ATCC, and the sequence 
is in the GenBank Database, accession nos. X54156, X60020, M14695, 
M 16494, K03199]. 

c. The CFTR gene 
Cystic fibrosis [CF] is an autosomal recessive disease that affects 
20 epithelia of the airways, sweat glands, pancreas, and other organs. It is a 
lethal genetic disease associated with a defect in CI transport, and is caused 
by mutations in the gene coding for cystic fibrosis transmembrane 
conductance regulator [CFTR], a 1480 amino acid protein that has been 
associated with the expression of chloride conductance in a variety of 
25 eukaryotic cell types. Defects in CFTR destroy or reduce the ability of 
epithelial cells in the airways, sweat glands, pancreas and other tissues to 
secret CI in response to cAMP-mediated agonists and impair activation of 
apical membrane channels by cAMP-dependent protein kinase A [PKA]. 
Given the high incidence and devastating nature of this disease, 
30 development of effective CF treatments is imperative. 
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The CFTR gene [ ~ 600 kb] can be transferred, such as from a CF-YAC 
[see Green et al. Science 250 :94-98] by construction of a selectable CF YAC 
by inserting a selectable marker, such as puromycin or hygromycin 
resistance and M-DNA by site specific integration into the neo- 
5 minichromosome or into a SATAC. The CF-YAC can be introduced into 
cells, such as EC3/7C5 or 19C5xHa4 by yeast protoplast fusion or 
microinjection of yeast nuclei into mammalian cells, select stable 
transformants, and establish antibiotic-resistant cell lines. 

2. Disease resistant animals and plants 

10 Artificial chromosomes are ideally suited for preparing disease 

resistant animals, including vertebrates and invertebrates, including fish as 
well as mammals. In particular, multivalent vaccines can be prepared. 
Such vaccines will be encoded by multiple antigens that can be included in 
a MAC and either delivered to a host to induce immunity, or incorporated 

15 into embryos to produce disease-resistant animals and plants [or plants and 
animals that are less susceptible]. 

Fish and crustaceans will serve as model hosts for production of 
disease resistant animals. 

3. Use of MACs and other artificial chromosomes for preparation 
20 of and screening of libraries 

Since large fragments can incorporated into each SATAC, entire 

genomes can be readily screened. For example, DNA encoding tree growth 

factors can be introduced into trees. Libraries can be prepared, introduce 

large fragments into chromosomes, and introduce them all into trees, thereby 

25 insuring expression. 

The following examples are included for illustrative purposes only and 

are not intended to limit the scope of the invention. 

EXAMPLE 1 

General Materials and Methods 

30 The following materials and methods are exemplary of methods that 

are used in the following Examples and that can be used to pre. Other 
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suitable materials and methods known to those of skill in the art may used. 
Modifications of these materials methods known to those of skill in the art 

may also be employed. 

A. Culture of cell lines, cell fusion, and transfection of cells 

5 1. Chinese hamster K20 cells and mouse A9 fibroblast cells 

were cultured in F-1 2 medium. EC3/7 [see, U.S. Patent No. 5,288,625, and 

deposited at the European Collection of Animal cell Culture (ECACC) under 

accession no. 90051001 ; see, also Hadlaczky et ak (1 991 ) Proc. Natl. Acad. 

Sci. U.S.A. 88:8106-81 10 and U.S. application Serial No. 08/375,271] and 

10 EC3/7C5 [see, U.S. Patent No. 5,288,625 and Praznovszky et aL (1991) 

Proc. Natl. Ar.ad. Sci. U.S.A. 88:11042-11046] mouse cell lines, and the 

KE1-2/4 hybrid cell line were maintained in F-1 2 medium containing 400 

yc/g/ml G-418 [SIGMA, St. Louis, MO]. 

2. TF1004G19 and TF1004G19C5 mouse cells, described 
15 below, and the 19xHa4 hybrid, described below, and its sublines were 

cultured in F-1 2 medium containing 400£/g/ml Hygromycin B [Calbiochem]. 
LP11 cells were maintained in F-1 2 medium containing 3-15 //g/ml 
Puromycin [SIGMA, St. Louis, MO]. 

3. Cotransfection of EC3/7C5 cells with plasmids [pH1 32, 
20 pCH1 1 0 available from Pharmacia, see, also Hall et ak ( 1 983) J. Mol. Appk 

Gen. 2:101-109] and with A DNA was made using the calcium phosphate 
DNA precipitation method [see, e^, Chen et ak (1987) Mol. Cell. Biol. 
7:2745-2752], using 2-5 jjq plasmid DNA and 20 jjg A phage DNA per 5 x 
10 6 recipient cells. 

25 4. Cell fusion 

Mouse and hamster cells were fused using polyethylene glycol 
[Davidson et ak (1976) Som. Cell Genet . 2:165-176). Hybrid cells were 
selected in HAT medium containing 400 //g/ml Hygromycin B. 

Approximately 2x1 0 7 recipient and 2x1 0 6 donor cells were fused using 

30 polyethylene glycol [Davidson et ak (1976) Som. Cell Genet. 2:165-176]. 
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Hybrids were selected and maintained in F-1 2/HAT medium [Szybalsky et aL 
(1962) Natl. Cancer Inst. Monogr. 7:75-89] containing 10% FCS and 400 
jug/ml G418. The presence of "parental" chromosomes in the hybrids cell 
lines was verified by in situ hybridizations with species-specific probes using 
biotin labeled human and hamster genomic DNA, and a mouse long 
interspersed repetitive DNA [pMCPEI .51 ]. 
5. Microcell fusion 
Microcell-mediated chromosome transfer was done according to 
Saxon etaL [(1985) Mol. Cell. Biol. 1:140-146] with the modifications of 
Goodfellow et aL [(1989) Techniques for mammalian genome transfer. 
In Genome Analysis a Practical Approach. K.E. Davies, ed., IRL Press, 
Oxford, Washington DC. pp. 1-1 7] and Yamada et aL 1(1990) Oncoqene 
5:1 141-1 147]. Briefly, 5 x 10 6 EC3/7C5 cells in a T25 flask were treated 
first with 0.05/yg/ml colcemid for 48 hr and then with 1 0 j/g/ml cytochalasin 
B for 30 min. The T25 flasks were centrifuged on edge and the pelleted 
microcells were suspended in serum free DME medium. The microcells were 
filtered through first a 5 micron and then a 3 micron polycarbonate filter, 
treated with 50 //g/ml of phytohemagglutin, and used for polyethylene glycol 
mediated fusion with recipient cells. Selection of cells containing the 
MMCneo was started 48 hours after fusion in medium containing 400-800 

//g/ml G418.6 

B. Chromosome banding 

Trypsin G-banding of chromosomes was performed using the method 
of Wang & Fedoroff [(1972) Nature 235:52-54], and the detection of 
constitutive heterochromatin with the BSG. C-banding method was done 
according to Sumner [(1972) Cell Res. 75:304-306]. For the detection of 
chromosome replication by bromodeoxyuridine [BrdU] incorporation, the 
Fluorescein Plus Giemsa [FPG] staining method of Perry & Wolff [(1974) 
Nature 251 :156-158] was used. 

C. Immunolabelling of chromosomes and in situ hybridization 
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Indirect immunofluorescence labelling with human anti-centromere 
serum LU851 [Hadlaczky fit aL (1 986) Exp. Cell Res. 167:1-1 5, and indirect 
immunofluorescence and in situ hybridization on the same preparation were 
performed as described previously [see, Hadlaczky et aL (1 991 ) Proc. Natl. 
5 Acad. Sci. U.S.A. 88:8106-8110, see, also U.S. application Serial No. 
08/375,271]. Immunolabelling with fluorescein-conjugated anti-BrdU 
monoclonal antibody [Boehringer] was performed accordingto the procedure 
recommended by the manufacturer, except that for treatment of mouse A9 
chromosomes, 2 M hydrochloric acid was used at 37° C for 25 min, and for 
10 chromosomes of hybrid cells, 1 M hydrochloric acid was used at 37° C for 
30 min. 

D. Scanning electron microscopy 

Preparation of mitotic chromosomes for scanning electron microscopy 
using osmium impregnation was performed as described previously [Sumner 
15 (1 991 ) Chromosoma 100:41 0-41 8]. The chromosomes were observed with 
a Hitachi S-800 field emission scanning electron microscope operated with 
an accelerating voltage of 25 kV. 

E. DNA manipulations, plasmids and probes 
1 . General methods 

20 All general DNA manipulations were performed by standard 

procedures [see, e^, Sambrook et aL (1989) Molecular cloning: A 
Laboratory Manual Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY]. The mouse major satellite probe was provided by Dr. J. B. 
Rattner [University of Calgary, Alberta, Canada]. Cloned mouse satellite 

25 DNA probes [see, Wong et aL (1 988) Nucl. Acids Res. 16:11645-11661], 
including the mouse major satellite probe, were gifts from Dr. J. B. Rattner, 
University of Calgary. Hamster chromosome painting was done with total 
hamster genomic DNA, and a cloned repetitive sequence specific to the 
centromeric region of the chromosome 2 [Fatyol et aL (1994) Nucl. Acids 
30 Res. 22:3728-3736] was also used. Mouse chromosome painting was done 
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with a cloned long interspersed repetitive sequence [pMCP1 .51 ] specific for 
the mouse euchromatin. 

For cotransfection and for in situ hybridization, the pCH110 /?- 
galactosidase construct [Pharmacia or Invitrogen], and >»cl 875 Sam7 phage 
5 DNA [New England Biolabs] were used. 

2. Construction of Plasmid pPuroTel 
Plasmid pPuroTel, which carries a Puromycin resistance gene and a 
cloned 2.5 kb human telomeric sequence [see, SEQ ID No 3], was 
constructed from the pBabe-puro retroviral vector [Morgenstern et aL ( 1 990) 
10 NucL Acids Res. 18:3587-3596; provided by Dr. L. Szekely (Microbiology 
and Tumorbiology Center, Karolinska Instituted Stockholm); see, also 
Tonghua et ak d 995) Chin. Med. J. (Beijing, Engl. Ed.) 108:653-659; Couto 
et aL (1994) Infect. Immun. 62:2375-2378; Dunckley et aL (1992) FEBS 
Lett. 296:128-34; Fr^"^ gl^L Anal. Biochem. 228:354-355; Liu et 

15 aL (1995) Blood 85:1095-1103; International PCT application Nos. WO 
9520044; WO 9500178, and WO 9419456]. 
F. Deposited cell lines 

Cell lines KE1 2/4, EC3/7C5, TF1004G19C5, 1 9C5xHa4, G3D5 and 
H1D3 and have been deposited in accord with the Budapest Treaty at the 
20 European Collection of Animal cell Culture (ECACC) under Accession Nos. 
96040924, 96040925, 96040926, 96040927, 96040928 and 96040929, 
respectively. 

EXAMPLE 2 

Preparation EC3/7, EC3/7C5 and related cell lines 

25 The EC3/7 cell line is an LMTK mouse cell line that contains the neo- 

centromere. The EC3/7C5 cell line is a single-cell subclone of EC3/7 that 
contains the neo-minichromosome. 



-44- 



6869-402A 



A. EC3/7 Cell line 

As described in U.S. Patent No. 5,288,625 [see, also Praznovszky et 
aN (1 991 ) Proc. Natl. Acad. Sci. U.S.A. 88: 1 1 042-1 1 046 and Hadlaczky et 
aL ( 1 99 1 ) Proc. Natl. Acad. Sci. U.S.A. 88:8106-8110] de novo centromere 
5 formation occurs in a transformed mouse LMTK-fibroblast cell line [EC3/7] 
after cointegration of A constructs [ACM8 and ^gtWESneo] carrying human 
and bacterial DNA. 

By cotransfection of a 14 kb human DNA fragment cloned in lambda 
MCM8] and a dominant marker gene MgtWESneo], a selectable centromere 

10 linked to a dominant marker gene [neo-centromere] was formed in mouse 
LMTK "cell line EC3/7 [Hadlaczky et aL (1 991 ) Proc. Natl. Acad. Sci. U.S.A. 
88:8106-81 10, see Figure 1] Integration of the heterologous DNA [the A 
DNA and marker gene-encoding DNA] occurred into the short arm of an 
acrocentric chromosome [chromosome 7 (see, Figure 1B)], where an 

15 amplification process resulted in the formation of the new centromere 
[neo-centromere (see Figure 1 C)]. On the dicentric chromosome (Figure 1 C) 
the newly formed centromere region contains all the heterologous DNA 
(human, lambda, and neo) introduced into the cell, and an active centromere. 

20 Having two functionally active centromeres on the same chromosome 

causes regular breakages between the centromeres [see, Figure 1E]. The 
distance between the two centromeres on the dicentric chromosome is 
estimated to be —10-15 Mb, and the breakage that separates the 
minichromosome occurred between the two centromeres. Such specific 

25 chromosome breakages result in the appearance [in approximately 10% of 
the cells] of a chromosome fragment that carries the neo-centromere [Figure 
1 F]. This chromosome fragment is principally composed of human, lambda, 
plasmid, and neo gene DNA, but it also has some mouse chromosomal DNA. 
Cytological evidence suggests that during the stabilization of the MMCneo, 

30 there was an inverted duplication of the chromosome fragment bearing the 
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neo-centromere. The size of minichromosomes in both cell lines is 
approximately 20-30 Mb; this finding indicates a two-fold increase in size. 

From the EC3/7 cell line, which contains the dicentric chromosome 
[Figure 1 E], two sublines [EC3/7C5 and EC3/7C6] were selected by repeated 
5 single-cell cloning. In these cell lines, the neo-centromere was found 
exclusively on a small chromosome [neo-minichromosome], while the 
formerly dicentric chromosome carried detectable amounts of the 
exogenously-derived DNA sequences but not an active neo-centromere 
[Figure 1F and 1G]. 

10 The minichromosomes of cell lines EC3/7C5 and EC3/7C6 are similar. 

No differences are detected in their architectures at either the cytological or 
molecular level. The minichromosomes were indistinguishable by 
conventional restriction endonuclease mapping or by long-range mapping 
using pulsed field electrophoresis and Southern hybridization. The 

1 5 cytoskeleton of cells of the EC3/7C6 line showed an increased sensitivity to 
colchicine, so the EC3/7C5 line was used for further detailed analysis. 
B. Preparation of the EC3/7C5 and EC3/7C6 cell lines 
Integration of the "foreign" DNA, and subsequent amplification of 
mouse and "foreign" DNA that leads to the formation of a dicentric 

20 chromosome, occurs at the centromeric region of the short arm of a mouse 
chromosome, identified by G-banding as mouse chromosome 7. Because of 
the presence of two functionally active centromeres on the same 
chromosome, regular breakages occur between the centromeres. Such 
specific chromosome breakages give rise to the appearance [in 

25 approximately 10% of the cells] of a chromosome fragment carrying the 
neo-centromere and the remaining portion of he formerly dicentric 
chromosome. From the EC3/7 cell line carrying either the dicentric 
chromosome or a chromosome fragment with the neo-centromere, two 
sublines [EC3/7C5 and EC3/7C6] were selected by repeated single-cell 

30 cloning. In these cell lines, the neo-centromere was found exclusively on a 
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minichromosome (neo-minichromosome), while the formerly dicentric 
chromosome carried traces of "foreign" DNA sequences. 

The EC37/C5 cells, which contain the dicentric chromosome, were 
produced by subcloning the EC3/7 cell line in high concentrations of G41 8 
5 [40-fold the lethal dose] for 350 generations. Two single cell-derived stable 
cell lines [EC3/7C5 and EC3/7C6] were established. These cell lines carry 
the human centromere on minichromosomes and also contain the remaining 
fragment of the dicentric chromosome. Indirect immunofluorescence with 
anti-centromere antibodies and subsequent in situ hybridization experiments 

10 demonstrated that the minichromosomes derived from the dicentric 
chromosome. In EC3/7C5 and EC3/7C6 cell lines (140 and 128 
metaphases, respectively) no dicentric chromosomes were found and 
minichromosomes were detected in 97.2% and 98.1% of the cells, 
respectively. The minichromosomes have been maintained for over 1 50 cell 

1 5 generations. They do contain the remaining portion of the formerly dicentric 
chromosome. 

Multiple copies of telomeric DNA sequences were detected in the 
marker centromeric region of the dicentric chromosome by in situ hybridiza- 
tion. This indicates that mouse telomeric sequences were coamplified with 

20 the foreign DNA sequences. These stable minichromosome-carrying cell 
lines provide direct evidence that the extra centromere containing human 
DNA is functioning and is capable of maintaining the minichromosomes [see, 
U.S. Patent No. 5,288,625]. 

The chromosome breakage in the EC3/7 cells, which separates the 

25 neo-centromere from the mouse chromosome, occurred in the G-band 
positive "foreign" DNA region. This is supported by the observation of 
traces of A and human DNA sequences at the broken end of the formerly 
dicentric chromosome. Comparing the G-band pattern of the chromosome 
fragment carrying the neo-centromere with that of the stable neo- 

30 minichromosome, reveals that the neo-minichromosome is an inverted 
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duplicate of the chromosome fragment that bears the neo-centromere. This 
is also evidenced by the observation that although the neo-minichromosome 
carries only one functional centromere, both ends of the minichromosome 
are heterochromatic, and mouse satellite DNA sequences were found in 
5 these heterochromatic regions by in situ hybridization. 

These two cell lines EC3/7C5 and EC3/7C6, thus, carry a selectable 
mammalian minichromosome [MMCneo] with a centromere linked to a 
dominant marker gene Hadlaczky et aL (1 991 ) Proc. Natl. Acad. Sci. U.S.A. 
88:8106-8110]. MMCneo is intended to be used as a vector for 

10 minichromosome-mediated gene transfer and has been used as model of a 
minichromosome-based vector system. 

Long range mapping studies of the MMCneo indicated that human 
DNA and the neo gene constructs integrated into the mouse chromosome 
separately, followed by the amplification of the chromosome region that 

1 5 contains the exogenous DNA. The MMCneo contains about 30-50 copies of 
the ACM8 and /IgtWESneo DNA in the form of approximately 160 kb 
repeated blocks, which together cover at least a 3.5 Mb region. In addition 
to these, there are mouse telomeric sequences [Praznovszky et aT (1991) 
Proc. Natl. Acad. Sci. U.S.A. 88:11042-11046] and any DNA of mouse 

20 origin necessary for the correct higher-ordered structural organization of 
chromatids. 

Using a chromosome painting probe mCPE1.51 [mouse long 
interspersed repeated DNA], which recognizes exclusively euchromatic 
mouse DNA, detectable amounts of interspersed repeat sequences were 

25 found on the MMCneo by in situ hybridization. The neo-centromere is 
associated with a small but detectable amount of satellite DNA. The 
chromosome breakage that separates the neo-centromere from the mouse 
chromosome occurs in the "foreign" DNA region. This is demonstrated by 
the presence of lambda and human DNA at the broken end of the formerly 

30 dicentric chromosome. At both ends of the MMCneo, however, there traces 
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of mouse major satellite DNA occur as evidence by in situ hybridization. 
This observation suggests that the doubling in size of the chromosome 
fragment carrying the neo-centromere during the stabilization of the 
MMCneo is a result of an inverted duplication. Although mouse telomere 
5 sequences, which coamplified with the exogenous DNA sequences during 
the neo-centromere formation, may provide sufficient telomeres for the 
MMCneo, the duplication could have supplied the functional telomeres for 
the minichromosome. 

C. Isolation and partial purification of minichromosomes 

10 Mitotic chromosomes of EC3/7C5 cells were isolated as described by 

Hadlaczky et aL [(1981) Chromosoma 81:537-555], using a 
glycine-hexylene glycol buffer system [Hadlaczky etaL (1 982) Chromosoma 
86:643-659]. Chromosome suspensions were centrifuged at 1 ,200 x g for 
30 minutes. The supernatant containing minichromosomes was centrifuged 

15 at 5,000 x g for 30 minutes and the pellet was resuspended in the 
appropriate buffer. Partially purified minichromosomes were stored in 50% 
glycerol at -20° C. 

D. Stability of the MMCneo maintenance and neo expression 
EC3/7C5 cells grown in non-selective medium for 284 days and then 

20 transferred to selective medium containing 400 //g/ml G41 8 showed a 96% 
plating efficiency (colony formation) compared to control cells cultured 
permanently in the presence of G418. Cytogenetic analysis indicated that 
the MMCneo is stably maintained at one copy per cell both under selective 
and non-selective culture conditions. Only two metaphases with two 

25 MMCneo were found in 2,270 metaphases analyzed. 

Southern hybridization analysis showed no detectable changes in DNA 
restriction patterns, and similar hybridization intensities were observed with 
a neo probe when DNA from cells grown under selective or non-selective 
culture conditions were compared. 
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Northern analysis of RNA transcripts from the neo gene isolated from 
cells grown under selective and non-selective conditions showed only minor 
and not significant differences. Expression of the neo gene persisted in 
EC3/7C5 cells maintained in F-1 2 medium free of G41 8 for 290 days under 
5 non-selective culture conditions. The long term expression of the neo 
gene(s) from the minichromosome may be influenced by the nuclear location 
of the MMCneo. In situ hybridization experiments revealed a preferential 
peripheral location of the MMCneo in the interphase nucleus. In more than 
60% of the 2,500 nuclei analysis, the minichromosome was observed at the 
10 perimeter of the nucleus near the nuclear envelope. 

EXAMPLE 3 

Minichromosome transfer and production of the ,4-neo-chromosome 
A. Minichromosome transfer 

The neo-minichromosome [referred MMCneo, FIG. 2C] has been used 
1 5 for gene transfer by fusion of minichromosome containing cells [EC3/7C5 or 
EC3/7C6] with different mammalian cells, including hamster and human. 
Thirty-seven stable hybrid cell lines have been produced. All established 
hybrid cell lines proved to be true hybrids as evidenced by in situ 
hybridization using biotinylated human, and hamster genomic, or pMCPEI .51 
20 mouse long interspersed repeated DNA probes for "chromosome painting". 

The MMCneo has also been successfully transferred into mouse A9, 
L929 and pluripotent F9 teratocarcinoma cells by fusion of microcells derived 
from EC3/7C5 cells. Transfer was confirmed by PCR, Southern blotting and 
in situ hybridization with minichromosome-specific probes. The cytogenetic 
25 analysis confirmed that, as expected for microcell fusion, a few cells [ 1 -5%] 
received [or retained] the MMCneo. 

These demonstrate that the MMCneo is tolerated by a wide range of 
cells. The prokaryotic genes and the extra dosage for the human and 
lambda sequences carried on the minichromosome seem to be not 
30 disadvantageous for tissue culture cells. 
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The MMCneo is the smallest chromosome of the EC3/7C5 genome 
and is estimated to be approximately 20-30 Mb, which is significantly 
smaller than the majority of the host cell (mouse) chromosomes. By virtue 
of the smaller size, minichromosomes can be partially purified from a 
5 suspension of isolated chromosomes by a simple differential centrifugation. 
In this way, minichromosome suspensions of 1 5-20% purity have been 
prepared. These enriched minichromosome preparations can be used to 
introduce, such as by microinjection or lipofection the minichromosome into 
selected target cells. Target cells include therapeutic cells that can be use 
10 in methods of gene therapy, and also embryonic cells for the preparation of 

transgenic animals. 

The MMCneo is capable of autonomous replication, is stably 
maintained in cells, and permits persistent expression of the neo gene(s), 
even after long-term culturing under non-selective conditions. It is a 

15 non-integrative vector that appears to occupy a territory near the nuclear 
envelope. Its peripheral localization in the nucleus may have an important 
role in maintaining the functional integrity and stability of the MMCneo. 
Functional compartmentalization of the host nucleus may have an effect on 
the function of foreign sequences. In addition, contains megabases of 

20 lambda DNA sequences that should serve as a target site for homologous 
recombination and thus integration of desired gene(s) into the MMCneo. 
It can be transferred by cell and microcell fusion, microinjection, 
electroporation, or chromosome uptake. The neo-centromere of the 
MMCneo is capable of maintaining and supporting the normal segregation of 

25 a larger 1 50-200 Mb /Ineo chromosome. This result (B) demonstrates that 
the MMCneo chromosome should be useful for carrying large fragments of 
heterologous DNA. 
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B. Production of the /1-neo-chromosome 

In one hybrid cell line KE1-2/4 made by fusion of EC3/7 and Chinese 
hamster ovary cells [FIG 2D], the separation of the neo-centromere from the 
dicentric chromosome was associated with a further amplification process. 
5 This amplification resulted in the formation of a stable, chromosome of 
average size [ i.e. , the A neo-chromosome; see, Praznovszky et ah (1991) 
Proc. Natl. Acad. Sci. U.S.A. 88:1 1042-1 1046]. The A neo-chromosome 
carries a terminally located functional centromere, and is composed of seven 
large amplicons containing multiple copiesof A, human, bacterial, and mouse 
10 DNA sequences [see FIG 2]. The amplicons are separated by mouse major 
satellite DNA [Praznovszky et aL (1991) Proc. Natl. Acad. Sci. U.S.A. 
88:11042-11046] which forms narrow bands of constitutive 
heterochromatin between the amplicons. 

EXAMPLE 4 

15 Formation of the "sausage chromosome" [SC] 

The findings set forth in the above EXAMPLES demonstrate that the 
centromeric region of the mouse chromosome 7 has the capacity for large- 
scale amplification [other results indicate that this capacity is not unique to 
chromosome 7]. This conclusion is further supported by results from 

20 cotransfection experiments, in which a second dominant selectable marker 
gene and a non-selected marker gene were introduced into EC3/7 cells 
carrying the formerly dicentric chromosome 7 and the neo-minichromosome. 

The EC3/7C5 cell line was transformed with A phage DNA, a 
hygromycin construct [pH132], and a /8-galactosidase construct [pCH1 10]. 

25 Stable transformants were selected in the presence of high concnetrations 
[400 //g/ml] Hygromycin B, and analysed by Southern hybridization. 
Established transformant cell lines showing multiple copies of integrated 
exogenous DNA were studied by in situ hybridization to localize the 
integration site(s), and by LacZ staining to detect/?-galactosidase expression. 

30 
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A. Materials and methods 

1. Construction of pH132 

The pH1 32 plasmid carries the hygromycin B resistance gene and the 
anti-HIV-1 gag ribozyme [see, SEQ ID NO. 6 for DNA sequence that 
corresponds to the sequence of the ribozyme] under control of the /?-actin 
promoter. This plasmid was constructed from pHyg plasmid [Sugden et aL 
(1985) MoL Cell. Biol. 5:410-413; a gift from Dr. A. D. Riggs, Beckman 
Research Institute, Duarte; see, also, e^, U.S. Patent No. 4,997,764), and 
from pPC-RAG1 2 plasmid [see, Chang et aL (1 990) Clin Biotech 2:23-31; 
provided by Dr. J. J. Rossi, Beckman Research Institute, Duarte; see, also 
U.S. Patent Nos. 5,272,262, 5,149,796 and 5,144,019, which describes 
the anti-HIV gag ribozyme and construction of a mammalian expression 
vector containing the ribozyme insert linked to the /?-actin promoter and 
SV40 late gene transcriptianl termination and polyA signals]. The ribozyme 
insert flanked by Bam HI linkers was inserted into BamHI-digested pH^-Apr- 
1gpt [see, Gunning et aL (1987) Proc. Natl. Acad. Sci. U.S.A. 84:4831- 
4835, see, also U.S. Patent No. 5,144,019]. An EcgRI/Xhgl fragment of 
this vector was inserted into EcoRI/ Xho l-digested pHyg. 

Plasmid pH132 was constructed as follows. First, pPC-RAG12 
[described by Chang etaj. ( 1 990) Clin. Biotech. 2:23-31 ] was digested with 
Bam HI to excise a fragment containing an anti-HIV ribozyme gene [referred 
to as ribozyme D by Chang et aL [(1 990) Clin. Biotech. 2:23-31]; see also 
U.S. Patent No. 5,144,019 to Rossi et aL., particularly Figure 4 of the 
patent] flanked by the human yS-actin promoter at the 5' end of the gene and 
the SV40 late transcriptional termination and polyadenylation signals at the 
3' end of the gene. As described by Chang et aL [(1990) Clin. Biotech. 
2:23-31], ribozyme D is targeted for cleavage of the translational initiation 
region of the HIV gag gene. This fragment of pPC-RAG1 2 was subcloned 
into P Bluescript-KS( + ) [Stratagene, La Jolla, CA] to produce plasmid 132. 
Plasmid 132 was then digested with Xhol and EcoRI to yield a fragment 
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containing the ribozyme D gene flanked by the £-actin promoter at the 5' 
end and the SV40 termination and polyadenylation signals at the 3' end of 
the gene. This fragment was ligated to the largest fragment generated by 
digestion of pHyg [Sugden et aL (1985) MoL CelK B[oL 5:410-413] with 
5 EcoRI and Sail to yield pH132. Thus, pH132 is an -9.3 kb plasmid 
containing the following elements: the /5-actin promoter linked to an anti-HIV 
ribozyme gene followed by the SV40 termination and polyadenylation 
signals, the thymidine kinase gene promoter linked to the hygromycin 
resistance gene followed by the thymidine kinase gene polyadenylation 
10 signal, and the E. coii ColE1 origin of replication and the ampicillin-resistance 
gene. 

The plasmid pHyg [see, e^, U.S. Patent Nos. 4,997,764, 
4,686, 1 86 and 5, 1 62,21 5], which confers resistance to hygromycin B using 
transcriptional controls from the HSV-1 tk gene, was originally constructed 

15 from pKan2 [Yates et aL (1984) Proc. Natl. Acad. Sci. U.S.A. 
81:3806-3810] and pLG89 [see, Gritz et ah (1983) Gene 25:179-188]. 
Briefly pKan2 was digested with Smal and Bglll to remove the sequences 
derived from transposon Tn5. The hygromycin-resistance hph gene was 
inserted into the digested pKan2 using blunt-end ligation at the Snal site and 

20 "sticky-end" ligation [using 1 Weiss unit of T4 DNA ligase (BRL) in 20 
microliter volume] at the Bglll site. The Smal and Bglll sites of pKan2 were 
lost during ligation. 

The resulting plasmid pH1 32, produced from introduction of the anti- 
HIV ribozyme construct with promoter and polyA site into pHyg, includes the 

25 anti-HIV ribozyme under control of the yff-actin promoter as well as the Hyg 
gene under control of the TK promoter. 
2. Chromosome banding 

Trypsin G-banding of chromosomes was performed as described in 
EXAMPLE 1. 
30 3. Cell cultures 
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TF1004G19 and TF1004G19C5 mouse cells and the 19xHa4 hybrid, 
described below, and its sublines were cultured in F-12 medium containing 
400 £/g/ml Hygromycin B [Calbiochem]. 

B. Cotransfection of EC3/7C5 to produce TF1004G-19 

5 Cotransfection of EC3/7C5 cells with plasmids [pH132, pCH110 

available from Pharmacia, see, also Hall et al_ (1983) J. Mol. Appl. Gen. 
2:101-109] and with A DNA [Ac\ 875 Sam 7(New England Biolabs)] was 
made using the calcium phosphate DNA precipitation method [see, e.g. , 
Chen etal. (1987) Mol. Cell. Biol. 7:2745-2752], using 2-5 jjg plasmid DNA 
10 and 20 pg A phage DNA per 5 x 10 6 recipient cells. 

C. Cell lines containing the sausage chromosome 

One transformant TF1004G-19 was identified. It has a high copy 
number of integrated pH1 32 and pCH1 10 sequences, and a high level of /?- 
galactosidase expression. G-banding and in situ hybridization with a human 

15 probe [CM8; see, e.g., U.S. application Serial No. 08/375,271] revealed 
unexpectedly that integration had occurred in the formerly dicentric 
chromosome 7 of the EC3/7C5 cell line. Furthermore, this chromosome 
carried a newly formed heterochromatic chromosome arm. The size of this 
heterochromatic arm varied between —150 and —800 Mb in individual 

20 metaphases. 

By single cell cloning from the TF1004G-19 cell line, a subclone 
TF1004G-19C5 [FIG 2D], which carries a stable chromosome 7 with a 
— 100-150 Mb heterochromatic arm [ the sausage chromosome] was 
obtained. This cell line has been deposited in the ECACC under Accession 

25 No. 96040926. This chromosome arm is composed of four to five satellite 
segments rich in satellite DNA, and evenly spaced integrated heterologous 
"foreign" DNA sequences. At the end of the compact heterochromatic arm 
of the sausage chromosome, a less condensed euchromatic terminal 
segment is regularly observed. This subclone was used for further 

30 analyses. 
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D. Demonstration that the sausage chromosome is derived from the 
formerly dicentric chromosome 

In situ hybridization with lambda phage and pH132 DNA on the 
TF1004G-19C5 cell line showed positive hybridization only on the 
5 minichromosome and on the heterochromatic arm of the "sausage" 
chromosome [Fig. 2D]. It appears that the "sausage" chromosome [herein 
also referred to as the SC] developed from the formerly dicentric 
chromosome (FD) of the EC3/7C5 cell line. 

To establish the, the integration sites of pCH1 1 0 and pH1 32 plasmids 
10 was determined. This was accomplished by in situ hybridization on these 
cells with biotin-labeled subf ragments of the hygromycin resistance gene and 
the £-galactosidase gene. Both experiments resulted in narrow hybridizing 
bands on the heterochromatic arm of sausage chromosome. The same 
hybridization pattern was detected on the sausage chromosome using a 
15 mixture of biotin-labeled A probe and pH132 plasmid, proving the 
cointegration of lambda phages, pH132 and pCH110 plasmids. 

To examine this further, the cells were cultured in the presence of the 
DNA-binding dye Hoechst 33258. Culturing of mouse cells in the presence 
of this dye results in under-condensation of the pericentric heterochromatin 
20 of metaphase chromosomes, thereby permitting better observation of the 
hybridization pattern. Using this technique the heterochromatic arm of 
sausage chromosome of 19C5 cell showed regular under-condensation 
revealing the details of the structure of "sausage" chromosome by in situ 
hybridization. Results of in situ hybridization on Hoechst-treated 
25 TF1004G/19C5 cells with biotin-labeled subfragments of hygromycin 
resistance and £-galactosidase genes shows that these genes are localized 
only in the heterochromatic arm of the sausage chromosome. In addition an 
equal banding hybridization pattern was observed. This pattern of repeating 
units [amplicons] clearly indicates that the sausage chromosome was formed 
30 by an amplification process and that the lambda phage, pH 1 32 and pCH 1 1 0 
plasmid DNA sequences border the amplicons. 
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In another series of experiments, using fluorescence in situ 
hybridization [FISH] was carried out with mouse major satellite DNA, the 
main component of the mouse pericentric heterochromatin, the results 
confirmed that the amplicons of the sausage chromosome are primarily 
5 composed of satellite DNA. 

E. The sausage chromosome has one centromere 

To determine whether mouse centromeric sequences had participated 
in the amplification process forming the "sausage" chromosome and whether 
or not the amplicons carry inactive centromeres, in situ hybridization was 
10 carried out with mouse minor satellite DNA. Mouse minor satellite DNA is 
localized specifically near the centromeres of all mouse chromosome. 
Positive hybridization was detected in all mouse centromeres including the 
sausage chromosome, which, however, only showed a positive signal at the 
beginning of the heterochromatic arm. 
15 Indirect immunofluorescence with human anti-centromere antibody 

[LU 851] that only which can recognizes the functional centromeres [see, 
e.g. , Hadlaczky et aL ( 1 989) Chromosoma 97:282-288] proved that sausage 
chromosome has only one active centromere. The centromere comes from 
the formerly dicentric part of the chromosome and co-localizes with the in 
20 situ hybridization signal of the mouse minor DNA probe. 

F. The selected and non-selected heterologous DNA in the 
heterocrhomatin of the sausage chromosome is expressed 

1 . High levels of the heterologous genes are expressed 

The TF1004G/19C5 cell line thus carries multiple copies of 
25 hygromycin resistance and /?-galactosidase genes localized only in the 
heterochromatic arm of the sausage chromosome. The 1 9C5 cells can grow 
very well in the presence of 200 jug/ml or even 400 //g/ml hygromycin B. 
(The level of expression was determined by Northern hybridization with 
subfragment of hygromycin resistance gene and single copy gene). 
30 The expression of the non-selected £-galactosidase gene in the 

TF1 004G/1 9C5 transformant, was detected with LacZ staining of the cells. 
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By this method one hundred percent of the cells stained dark blue, showing 
that there is a high level of /?-galactosidase expression in all of 19C5 cells. 
2. The heterologous genes that are expressed are in the 
heterochromatin 

To demonstrate that the genes localized in the constitutive 
heterochromatin of the sausage chromosome provide the hygromycin 
resistance and the LacZ staining capability of 1 9C5 transformant [Le^-gal 
express], PEG induced cell fusion between a TF1004G/19C5 mouse cell and 
Chinese hamster ovary cell was performed. The hybrids were selected and 
maintained in HAT medium containing G41 8 [400 jug/ml] and hygromycin 
[200 //g/ml]. Two hybrid clones designated 1 9C5xHa3 and 19C5xHa4, 
which has been deposited in the ECACC under Accession No. 96040927, 
were selected. Both carry the sausage chromosome and the 
minichromosome. 

Twenty-seven single cell derived colonies of 19C5xHa4 hybrid were 
maintained and analyzed as individual subclones. In situ hybridization with 
hamster and mouse chromosome painting probes and hamster chromosome 
2 specific probes verified that the 19C5xHa4 clone contains the complete 
Chinese hamster genome and a partial mouse genome. All 19C5xHa4 
subclones retained the hamster genome, but different subclones showed 
different number of mouse chromosomes indicating the preferential 
elimination of mouse chromosomes. 

To promote further elimination of mouse chromosomes hybrid cells 
were repeatedly treated with BrdU. The BrdU treatments, which destabilize 
the genome, result in significant loss of mouse chromosomes. The BrdU 
treated 1 9C5xHa4 hybrid cells were divided to three groups. One group of 
the hybrid cells (GH) were maintained in the presence of hygromycin (200 
//g/ml) and G418 (400 //g/ml), and the other two groups of the cells were 
cultured under G418 (G) or hygromycin (H) selection conditions to promote 
the elimination of sausage or minichromosome. 
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One month later, single cell derived subclones were established from 
these three subcultures of the TF1 004G/1 9C5xHa4 hybrid line. The 
subclones were monitored by in situ hybridization with biotin-labeled lambda 
phage and hamster chromosome painting probes. Four individual clones 
5 [G2B5, G3C5, G4D6, G2B4] selected in the presence of G41 8 that have lost 
the sausage chromosome but retained minichromosome were found. Under 
hygromycin selection only one subclone [H1D3] lost the minichromosome. 
In this clone the sausage chromosome was present. 

Since hygromycin resistance and £-galactosidase genes were thought 
10 to be expressed from the sausage chromosome, the expression of these 
genes were analyzed in the four subclones that had lost the sausage 
chromosome. In the presence of 200 /zg/ml hygromycin, one hundred 
percent of the cells of four individual subclones died. In order to detect the 
/?-galactosidase expression hybrid subclones were analyzed by LacZ staining. 
1 5 One hundred percent of the cells of the four subclones that lost the sausage 
chromosome also lost the LacZ staining capability. All of the other hybrid 
subclones that not lost the sausage chromosome under the non-selective 
culture conditions showed positive LacZ staining. 

These findings demonstrate that the expression of hygromycin 
20 resistance and £-galactosidase genes is linked to the presence of the 
sausage chromosome and results of in situ hybridizations show that the 
heterologous DNA is expressed from the constitutive heterochromatin of the 
sausage chromosome. 

By in situ hybridization in three other hybrid subclones [G2C6, G2D1 , 
25 and G4D5] the presence of the sausage chromosome was not detected. By 
the LacZ staining method some stained cells were detected in these hybrid 
lines and when these subclones were transferred to hygromycin selection 
some colonies survived. Cytological analysis and in situ hybridization of 
these hygromycin resistance colonies revealed the presence of the sausage 
30 chromosome, suggesting that only the cells of G2C6, G2D1 and G4D5 
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hybrids that had not lost the sausage chromosome were able to preserve the 
hygromycin resistance and yS-galactosidase expression. These results 
confirmed that the expression of these genes is linked to the presence of the 
sausage chromosome. The level of £-galactosidase expression was 

5 determined by the immunoblot technique using monoclonal antibody. 

Hygromycin resistance and yS-galactosidase expression of the cells 
which contained the sausage chromosome were provided by the genes 
localized in the mouse pericentric heterochromatin. This was demonstrated 
by performing Southern DNA hybridizations on the hybrid cells that lack the 

10 sausage chromosome with PCR amplified subfragments of hygromycin 
resistance and yS-galactosidase genes. None of the subclones showed 
hybridization with these probes, however, all of the analyzed clones 
contained the minichromosome as well. Other hybrid clones that contain the 
sausage chromosome showed intense hybridization with these DNA probes. 

15 These results lead to the conclusion that hygromycin resistance and 0- 
galactosidase expression of the cells that contain the sausage chromosome 
were provided by the genes localized in the mouse pericentric 
heterochromatin. 

EXAMPLE 5 

20 The gigachromosome 

As described in Example 4, the sausage chromosome was transferred 
into Chinese hamster cells by cell fusion. Using Hygromycin B/HAT 
selection, two hybrid clones 1 9C5xHa3 and 1 9C5xHa4 were produced that 
carry sausage chromosome. In situ hybridization, using hamster and mouse 

25 chromosome-painting probes and a hamster chromosome 2 specific probe, 
verified that clone 1 9C5xHa4 contains a complete Chinese hamster genome 
as well as partial mouse genomes. Twenty-seven separate colonies of 
19C5xHa4 cells were maintained and analyzed as individual subclones. 
Twenty-six out of 27 subclones contained a morphologically unchanged 

30 sausage chromosome. 
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In one clone 1 9C5xHa47 [see Fig. 2E], the heterochromatic arm of the 
sausage chromosome became unstable and showed continuous 
intrachromosomal growth. In extreme cases, the amplified chromosome arm 
exceeded 1000 Mb in size (gigachromosome). 
5 EXAMPLE 6 

The stable megachromosome 
A. Formation of the megachromosome 

All 19C5xHa4 subclones retained a complete hamster genome, but 
different subclones showed different numbers of mouse chromosomes, 
1 0 indicating the preferential elimination of mouse chromosomes. As described 
in Example 4, to promote further elimination of mouse chromosomes, hybrid 
cells were repeatedly treated with 10" 4 M BrdU for 16 hours and single cell 
subclones were established. The BrdU treatments appeared to destabilize 
the genome, resulting in a change in the sausage chromosome as well. A 
15 gradual increase in a cell population in which a further amplification had 
occurred was observed. In addition to the ~ 1 00-1 50 Mb heterochromatic 
arm of the sausage chromosome, an extra centromere and a ~ 1 50-250 Mb 
heterochromatic chromosome arm were formed, which differed from those 
of mouse chromosome 7. By the acquisition of another euchromatic 

20 terminal segment, a new submetacentric chromosome (megachromosome) 
was formed. Seventy-nine individual subclones were established from these 
BrdU-treated cultures, by single-cell cloning: 42 subclones carried the intact 
megachromosome, 5 subclones carried the sausage chromosome, and in 32 
subclones fragments or translocated segments of the megachromosome 
25 were observed. Twenty-six subclones were cultured under non-selective 
conditions over a 2 month period. In 19 out of 26 subclones, the 
megachromosome was retained. Those subclones which lost the 
megachromosomes all became sensitive to Hygromycin B and had no /?- 
galactosidase expression, indicating that both markers were linked to the 
30 megachromosome. 
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Two sublines (G3D5 and H1D3), which were chosen for further 
experiments, showed no changes in the morphology of the 
megachromosome during more than 100 generations under selective 
conditions. 

5 B. Structure of the megachromosome 

The following results demonstrate that, apart from the euchromatic 
terminal segments, the whole megachromosome is constitutive 
heterochromatin, containing a tandem array of at least 40 [-7.5 Mb] 
blocks of mouse major satellite DNA [see Figures 2 and 3]. Four satellite 
10 DNA blocks are organized into a giant palindrome [amplicon] carrying 
integrated exogenous DNA sequences at each end. The long and short 
arms of the submetacentric megachromosome contains 6 and 4 
amplicons, respectively. 

1. The megachromosome is composed primarily of 
1 5 heterochromatin 

Except for the terminal regions, the megachromosome is composed 
primarily of heterochromatin. This was demonstrated by G-banding of the 
megachromosome,, which resulted in positive staining characteristic of 
constitutive heterochromatin. Apart from the terminal regions, the whole 
20 megachromosome appears to be heterochromatic. Mouse major satellite 
DNA is the main component of the pericentric, constitutive heterochromatin 
of mouse chromosomes and represents -10% of the total DNA [Waring et 
aL(1966) Science 154 :791-794]. Using a mouse major satellite DNA probe 
for in situ hybridization, strong hybridization was observed throughout the 
25 megachromosome, except for its terminal regions. The hybridization showed 
a segmented pattern: four large blocks appeared on the short arm and 
usually 4-7 blocks were seen on the long arm. By comparing these 
segments with the pericentric regions of normal mouse chromosomes that 
carry approximately -15 Mb of major satellite, the size of the blocks of 
30 major satellite on the megachromosome was estimated to be -30 Mb. 
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Using a mouse probe specific to euchromatin [pMCPEI .51 ; a mouse 
long interspersed repeated DNA probe], positive hybridization was detected 
only on the terminal segments of the megachromosome of the H1 D3 hybrid 
subline. In the G3D5 hybrids, hybridization with a hamster-specific probe 
5 revealed that several megachromosomes contained terminal segments of 
hamster origin on the long arm. This observation indicated that the 
acquisition of the terminal segments on these chromosomes happened in the 
hybrid cells, and that the long arm of the megachromosome was the recently 
formed one. When a mouse minor satellite probe was used, specific 

10 to the centromeres of mouse chromosomes [Wong et al. (1 988) Nucl. Acids 
Res. 16 :11645-11661], a strong hybridization signal was detected only at 
the primary constriction of the megachromosome, which colocalized with the 
positive immunofluorescence signal produced with human anti-centromere 
serum [LU 851]. 

15 In situ hybridization experiments with pH132, pCH1 10, and A DNA 

probes revealed that all heterologous DNA was located in the gaps between 
the mouse major satellite segments. Each segment of mouse major satellite 
was bordered by a narrow band of integrated heterologous DNA, except at 
the second segment of the long arm where a double band existed, indicating 

20 that the major satellite segment was missing or considerably reduced in size 
here. This chromosome region served as a useful cytological marker in 
identifying the long arm of the megachromosome. At a frequency of 10- 4 , 
"restoration" of these missing satellite DNA blocks was observed in one 
chromatid, when the formation of a whole segment on one chromatid 

25 occurred. 

After Hoechst 33258 treatment (50 //g/ml for 16 hours), the 
megachromosome showed undercondensation throughout its length except 
for the terminal segments. This made it possible to study the architecture 
of the megachromosome at higher resolution. In situ hybridization with the 
30 mouse major satellite probe on undercondensed megachromosomes 
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demonstrated that the —30 Mb major satellite segments were composed of 
four blocks of —7.5 Mb separated from each other by a narrow band of non- 
hybridizing sequences [Figure 3]. Similar segmentation can be observed in 
the large block of pericentric heterochromatin in metacentric mouse 
5 chromosomes from the LMTK and A9 cell lines. 

2. The megachromosome is composed of segments containing 
two tandem — 7.5 Mb blocks are followed by two inverted 
blocks 

Because of the asymmetry in thymidine content between the two 

10 strands of the DNA of the mouse major satellite, when mouse cells are 
grown in the presence of BrdU for a single S phase, the constitutive 
heterochromatin shows lateral asymmetry after FPG staining. Also, in the 
19C5xHa4 hybrids, the thymidine-kinase the [Tk] deficiency of the mouse 
fibroblast cells was complemented by the hamster Tk gene, permitting BrdU 

15 incorporation experiments. 

A striking structural regularity in the megachromosome was detected 
using the FPG technique. In both chromatids, alternating dark and light 
staining that produced a checkered appearance of the megachromosome 
was observed. A similar picture was obtained by labelling with fluorescein- 

20 conjugated anti-BrdU antibody. Comparing these pictures to the segmented 
appearance of the megachromosome, showed that one dark and one light 
FPG band corresponded to one —30 Mb segment of the megachromosome. 

These results suggest that the two halves of the —30 Mb segment 
have an inverted orientation. This was verified by combining in situ 

25 hybridization and immunolabelling of the incorporated BrdU with fluorescein- 
conjugated anti-BrdU antibody on the same chromosome. Since the —30 
Mb segments of the megachromosome are composed of four blocks of 
mouse major satellite, it can be concluded that two tandem — 7.5 Mb blocks 
are followed by two inverted blocks within one segment. 
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Large-scale mapping of megachromosome DNA by pulsed-field 
electrophoresis and Southern hybridization with "foreign" DNA probes 
revealed a simple pattern of restriction fragments. Using endonucleases with 
none, or only a single cleavage site in the integrated foreign DNA sequences, 
5 followed by hybridization with a hyg probe, 1 -4 predominant fragments were 
detected. Since the megachromosome contains 10-12 amplicons with an 
estimated 3-8 copies of hyg sequences per amplicon (30-90 copies per 
megachromosome), the small number of hybridizing fragments indicates the 
homogeneity of DNA in the amplified segments. 
10 3. Scanning electron microscopy of the megachromosome 

confirmed the above findings 
The homogeneous architecture of the heterochromatic arms of the 
megachromosome was confirmed by high resolution scanning electron 
microscopy. Extended arms of megachromosomes, and the pericentric 
15 heterochromatic region of mouse chromosomes, treated with Hoechst 
33258, showed similar structure. The constitutive heterochromatic regions 
appeared more compact than the euchromatic segments. Apart from the 
terminal regions, both arms of the megachromosome were completely 
extended, and showed faint grooves, which should correspond to the border 
20 of the satellite DNA blocks in the non-amplified chromosomes and in the 
megachromosome. Without Hoechst treatment, the grooves seemed to 
correspond to the amplicon borders on the megachromosome arms. In 
addition, centromeres showed a more compact, finely fibrous appearance 
than the surrounding heterochromatin. 
25 C. Formation of the megachromosome 

Figure 2 schematically sets forth the events leading to the formation 
of the stable megachromosome: (A) A single E-type amplification in the 
centromeric region of chromosome 7 generates the neo-centromere linked 
to the integrated foreign DNA, and forms a dicentric chromosome. Multiple 
30 E-type amplification forms the A neo-chromosome, which was derived from 
chromosome 7 and stabilized in a mouse-hamster hybrid cell line; (B) Specific 
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breakage between the centromeres of a dicentric chromosome 7 generates 
a chromosome fragment with the neo-centromere, and a chromosome 7 with 
traces of foreign DNA at the end; (C) Inverted duplication of the fragment 
bearing the neo-centromere results in the formation of a stable neo- 
5 minichromosome; (D) Integration of exogenous DNA into the foreign DNA 
region of the formerly dicentric chromosome 7 initiates H-type amplification, 
and the formation of a heterochromatic arm. By capturing a euchromatic 
terminal segment, this new chromosome arm is stabilized in the form of the 
"sausage" chromosome; (E) BrdU treatment and/or drug selection appears 

10 to induce further H-type amplification, which results in the formation of an 
unstable gigachromosome: (F) Repeated BrdU treatments and/or drug 
selection induce further H-type amplification including a centromere 
duplication, which leads to the formation of another heterochromatic 
chromosome arm. It is split off from the chromosome 7 by chromosome 

15 breakage and acquires a terminal segment to form the stable 
megachromosome. 

EXAMPLE 7 

Summary of some of the cell lines with SAT ACS and minichromosomes that 
have been constructed 

20 LMTK-derived cell line, which is a mouse fibroblast cell line, was 

transfected with /CM8 and /IgtWESneo DNA [see, EXAMPLE 2] to produce 
transformed cell lines. Among these cell lines was EC3/7, deposited at the 
European Collection of Animal cell Culture (ECACC) under Accession No. 
90051001 [see, U.S. Patent No. 5,288,625; see, also Hadlaczky et aL 

25 (1991) Proc. Natl. Acad. Sci. U.S.A. 88 :8106-8110 and U.S. application 
Serial No. 08/375,271]. This cell line contains the dicentric chromosome 
with the neo centromere. Recloning and selection produced cell lines such 
as EC3/7C5, which are cell lines with the stable neo-minichromosome [see. 
Fig. 2C]. 

30 Fusion of EC3/7 with CHO-K20 cells and selection with G41 8/HAT 

produced hybrid cell lines, among these was KE1 2/4, which has been 
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deposited with the ECACC under Accession No. 96040924. KE1 2/4 is a 
stable cell line that contains the A neo-chromosome [see, Fig. 2D; see, also 
U.S. Patent No. 5,288,625], produced by E type amplifications. KE1 2/4 
has been transfected with vectors containing lambda DNA, selectable 
5 markers, such as puromycin resistance, and genes of interest, such as p53, 
anti-HIV ribozymes. These vectors target the gene of interest into the A neo- 
chromosome by virtue of homologous recombination with the heterologous 
DNA in the chromosome. 

The EC3/7C5 cell line has been co-transfected with pH132, pCH1 10 
10 and A DNA [see, EXAMPLE 2] as well as other constructs. Various clones 
and subclones have been selected. For example transformation with a 
construct that includes p53 encoding DNA, produced cells designated 
C5pMCT53. 

As discussed above, cotransfection of EC3/7C5 cells with plasmids 
15 [pH132, pCH1 10 available from Pharmacia, see, also Hall et aL (1983) ± 
Mol. AppI. Gen. 2: 1 01 -1 09] and with A DNA [Ac\ 875 Sam 7(New England 
Biolabs)] produced transformed cells. Among these is TF1004G24, which 
contains the DNA encoding the anti-HIV ribozyme in the neo- 
minichromosome. Recloning of TF1 004G24 produced numerous cell lines. 
20 Among these the NHHL24 cell line. This cell line also has the anti-HIV 
ribozyme in the neo-minichromosome and expresses high levels of 0-gal. It 
has been fused with CHO-K20 cells to produce various hybrids. 

Recloning and selection of the TF1004G transformants produced the 
cell line TF1004G-19, discussed above in EXAMPLE 4, which contains the 
25 unstable sausage chromosome. Single cell cloning produced the TF1004G- 
19C5 [see Figure 4] cell line, which has a stable sausage chromosome. 
TF1004G-19C5 has been fused with CHO cells and the hybrids grown 
under selective conditions to produce the 1 9C5xHa4cell line [see, EXAMPLE 
4] and others. BrdU treatment of 19C5xHa4 cells and growth under 
30 selective conditions [neomycin (G) and/or hygromycin (H)] as produced 
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hybrid cell lines with the gigachromsome [see Figure 2E] and the G3D5 and 
G4D6 cell lines and others. G3D5 has the neo-minichromosome and the 
megachromosome, G4D6 has only the neo-minichromosome. 

Recloning of G3D5 in GH medium produced numerous clones. Among 
5 these is H1D3 [see Figure 4], which has the stable megachromosome. 
Repeated BrDU treatment and recloning has produced the HB31 cell line, 
which has been used for transformations with the pTEMPUD, pTEMPU and 
pTEMPU3 vectors [see, Example 12, below]. 

H1D3 has been fused with a CD4 + Hela cell line that carries DNA 
10 encoding CD4 and neomycin resistance on a plasmid [see, e^, U.S. Patent 
Nos. 5,413,914, 5,409,810, 5,266,600, 5,223,263, 5,215,914 and 
5,144,019, which describe these Hela cells]. Selection with GH has 
produced hybrids, including H1xHE41 [see Figure 4], which carries the 
megachromosome and also a single human chromosome that includes the 
15 CD4neo construct. Repeated BrdU treatment and single cell cloning has 
produced cell lines with the megachromosome [cell line 1 B3, see Figure 4], 
cell lines, such as 1 B4 and others that have a dwarf megachromosome 
[-150-200 Mb] and cell lines , such as 1C3, which has micro- 
megachromosome [-60-90 Mb]. About 25% of the 1 B3 cells have a 
20 truncated megachromosome [-90-120 Mb]. 

EXAMPLE 8 
Replication of the megachromosome 

This homogeneous architecture of the megachromomes provides a 
unique opportunity to perform a detailed analysis of the replication of the 
25 constitutive heterochromatin. 
A. Materials and methods 

1 . Culture of cell lines 

H1D3 mouse-hamster hybrid cells carrying the megachromosome [see, 
EXAMPLE 4] were cultured in F-1 2 medium containing 1 0% fetal calf serum 
30 [FCS] and 400 /L/g/ml Hygromycin B [Calbiochem]. G3D5 hybrid cells [see. 
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Example 4] were maintained in F-12 medium containing 10% FCS, 400 
//g/ml Hygromycin B (Calbiochem), and 400/ig/ml G41 8[SIGMA]. Mouse A9 
fibroblast cells were cultured in F-1 2 medium supplemented with 1 0% FCS. 
2. BrdU labelling 
5 In typical experiments, 20-24 parallel semi-confluent cell cultures were 

set up in 10 cm Petri dishes. Bromodeoxyuridine (BrdU) (Fluka) was 
dissolved in distilled water alkalized with a drop of NaOH, to make a 10' 2 M 
stock solution. Aliquots of 10-50^1 of this BrdU stock solution were added 
to each 10 ml culture, to give a final BrdU concentration of 10-50/vM. The 
10 cells were cultured in the presence of BrdU for 30 min, and then washed 
with warm complete medium, and incubated without BrdU until required. At 
this point, 5 /yg/ml colchicine was added to a sample culture every 1 or 2 h. 
After 1-2 h colchicine treatment, mitotic cells were collected by "shake-off" 
and regular chromosome preparations were made for immunolabelling. 
! 5 3. Immunolabelling of chromosomes and in situ hybridization 

Immunolabelling with fluorescein-conjugated anti-BrdU monoclonal 
antibody (Boehringer) was done according to the manufacturer's 
recommendations, except that for mouse A9 chromosomes, 
2 M hydrochloric acid was used at 37° C for 25 min, while for chromosomes 
20 of hybrid cells, 1 M hydrochloric acid was used at 37° C for 30 min. In situ 
hybridization with biotin-labelled probes, and indirect immunofluorescence 
and in situ hybridization on the same preparation, were performed as 
described previously [Hadlaczky et ak d 991 ) Proc. Natl. Acad. Sci. U.S.A. 
88:8106-8110, see, also U.S. Patent No. 5,288,625]. 
25 4. Microscopy 

All observations and microphotography were made by using a Vanox 
AHBS (Olympus) microscope. Fujicolor 400 Super G or Fujicolor 1 600 Super 
HG high-speed colour negatives were used for photographs. 
B. Results 
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The replication of the megachromosome was analyzed by BrdU 
pulse labelling followed by immunolabelling. The basic parameters for 
DNA labelling in vivo were first established. Using a 30 min pulse of 
50 fjM BrdU in parallel cultures, samples were taken and fixed at 5 min 
intervals from the beginning of the pulse, and every 15 min up to 1 h 
after the removal of BrdU. Incorporated BrdU was detected by 
immunolabelling with fluorescein-conjugated anti-BrdU monoclonal 
antibody. At the first time point (5 min) 38% of the nuclei were labelled, 
and a gradual increase in the number of labelled nuclei was observed 
during incubation in the presence of BrdU, culminating in 46% in the 30 
min sample, at the time of the removal of BrdU. At further time points 
(60, 75, and 90 min) no significant changes were observed, and the 
fraction of labelled nuclei remaining constant [44.5-46%]. 

These results indicate that (i) the incorporation of the BrdU is a 
rapid process, (ii) the 30 min pulse-time is sufficient for reliable labelling 
of S-phase nuclei, and (iii) the BrdU can be effectively removed from the 
cultures by washing. 

The length of the cell cycle of the 1 9C5xHa4 hybrid cells was 
estimated by measuring the time between the appearance of the earliest 
BrdU signals on the extreme late replicating chromosome segments and 
the appearance of the same pattern only on one of the chromatids of the 
chromosomes after one completed cell cycle. The length of G2 period 
was determined by the time of the first detectable BrdU signal on 
prophase chromosomes and by the labelled mitoses method [Qastler et al 
(1959) Exp. Cell Res. 17:420-438], The length of the S-phase was 
determined in three ways: (i) on the basis of the length of cell cycle and 
the fraction of nuclei labelled during the 30-120 min pulse; (ii) by 
measuring the time between the very end of the replication of the 
extreme late replicating chromosomes and the detection of the first signa 
on the chromosomes at the beginning of S phase; (iii) by the labelled 
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mitoses method. In repeated experiments, the duration of the cell cycle 
was found to be 22-26 h, the S phase 10-14 h, and the G2 phase 3.5- 
4.5 h. 

Analyses of the replication of the megachromosome were made in 
5 parallel cultures by collecting mitotic cells at two hour intervals following 
two hours of colchicine treatment. In a repeat experiment, the same 
analysis was performed using one hour sample intervals and one hour 
colchicine treatment. Although the two procedures gave comparable 
results, the two hour sample intervals were viewed as more appropriate 

10 since approximately 30% of the cells were found to have a considerably 
shorter or longer cell cycle than the average. The characteristic 
replication patterns of the individual chromosomes, especially some of the 
late replicating hamster chromosomes, served as useful internal markers 
for the different stages of S-phase. To minimize the error caused by the 

15 different lengths of cell cycles in the different experiments, samples were 
taken and analyzed throughout the whole cell cycle until the appearance 
of the first signals on one chromatid at the beginning of the second S- 
phase. 

The sequence of replication in the megachromosome is as follows. 

20 At the very beginning of the S-phase, the replication of megachromosome 
starts at the ends of the chromosomes. The first initiation of replication 
in an interstitial position can usually be detected at the centromeric 
region. Soon after, but still in the first quarter of the S-phase, when the 
terminal region of the short arm has almost completed its replication, 

25 discrete initiation signals appear along the chromosome arms. In the 

second quarter of the S-phase, as replication proceeds, the BrdU-labelled 
zones gradually widen, and the checkered pattern of the 
megachromosome becomes clear [see, e^, Fig. 2F]. At the same time, 
pericentric regions of mouse chromosomes also show intense 

30 incorporation of BrdU. The replication of the megachromosome peaks at 
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the end of the second quarter and in the third quarter of the S-phase. At 
the end of the third quarter, and at the very beginning of the last quarter 
of the S-phase, the megachromosome and the pericentric heterochromatin 
of the mouse chromosomes complete their replication. By the end of S- 
5 phase only the very late replicating segments of mouse and hamster 
chromosomes are still incorporating BrdU. 

The replication of the whole genome occurs in distinct phases. The 
signal of incorporated BrdU increased continuously until the end of the 
first half of the S-phase, but at the beginning of the third quarter of the S- 
10 phase chromosome segments other than the heterochromatic regions 
hardly incorporated BrdU. In the last quarter of the S-phase, the BrdU 
signals increased again when the extreme late replicating segments 
showed very intense incorporation. 

Similar analyses of the replication in mouse A9 cells were 
15 performed as controls. To increase the resolution of the immunolabelling 
pattern, pericentric regions of A9 chromosomes were decondensed by 
treatment with Hoechst 33258. Because of the intense replication of the 
surrounding euchromatic sequences, precise localization of the initial BrdU 
signal in the heterochromatin was normally difficult, even on 
20 undercondensed mouse chromosomes. On those chromosomes where 

the initiation signal(s) were localized unambiguously, the replication of the 
pericentric heterochromatin of A9 chromosomes was similar to that of the 
megachromosome. Chromosomes of A9 cells also exhibited replication 
patterns and sequences similar to those of the mouse chromosomes in 
25 the hybrid cells. These results indicate that the replicators of the 

megachromosome and mouse chromosomes retained their original timing 
and specificity in the hybrid cells. 

By comparing the pattern of the initiation sites obtained after BrdU 
incorporation with the location of the integration sites of the "foreign" 
30 DNA in a detailed analysis of the first quarter of the S-phase, an attempt 
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was made to identify origins of replication (initiation sites) in relation to 
the amplicon structure of the megachromosome. The double band of 
integrated DNA on the long arm of the megachromosome served as a 
cytological marker. The results showed a colocalization of the BrdU and 
in situ hybridization signals found at the cytological level, indicating that 
the "foreign" DNA sequences are in close proximity to the origins of 
replication, presumably integrated into the non-satellite sequences 
between the replicator and the satellite sequences [see, Figure 3]. In the 
pericentric region of several other chromosomes, dot-like BrdU signals can 
also be observed that are comparable to the initiation signals on the 
megachromosome. These signals may represent similar initiation sites in 
the heterochromatic regions of normal chromosomes. 

At a frequency of 10" 4 , "uncontrolled" amplification of the 
integrated DNA sequences was observed in the megachromosome. 
Consistent with the assumption (above) that "foreign" sequences are in 
the proximity to the replicators, this spatially restricted amplification is 
likely to be a consequence of uncontrolled repeated firings of the 
replication origin(s) without completing the replication of the whole 
segment. 
C. Discussion 

It has generally been thought that the constitutive heterochromatin 
of the pericentric regions of chromosomes is late replicating [see, e^, 
Miller (1976) Chromosoma 55:1 65-1 70]. On the contrary, these 
experiments evidence that the replication of the heterochromatic blocks 
starts at a discrete initiation site in the first half of the S-phase and 
continues through approximately three-quarters of S-phase. This 
difference can be explained in the following ways: (i) in normal 
chromosomes, actively replicating euchromatic sequences that surround 
the satellite DNA obscure the initiation signals, and thus the precise 
localization of initiation sites is obscured; (ii) replication of the 
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heterochromatin can only be detected unambiguously in a period during 
the second half of the S-phase, when the bulk of the heterochromatin 
replicates and most other chromosomal regions have already completed 
their replication, or have not yet started it. Thus, low resolution 
5 cytological techniques, such as analysis of incorporation of radioactively 
labelled precursors by autoradiography, only detect prominent replication 
signals in the heterochromatin in the second half of S-phase, when 
adjacent euchromatic segments are no longer replicating. 

In the megachromosome, the primary initiation sites of replication 
10 colocalize with the sites where the "foreign" DNA sequences are 
integrated at the amplicon borders. Similar initiation signals were 
observed at the same time in the pericentric heterochromatin of mouse 
chromosomes that do not have "foreign" DNA, indicating that the 
replication initiation sites at the borders of amplicons may reside in the 
15 non-satellite flanking sequences of the satellite DNA blocks. The 

presence of a primary initiation site at each satellite DNA doublet implies 
that this large chromosome segment is a single huge unit of replication 
[megareplicon] delimited by the primary initiation site and the termination 
point at each end of the unit. Several lines of evidence indicate that, 
20 within this higher-order replication unit, "secondary" origins and replicons 
contribute to the complete replication of the megareplicon: 

1 . The total replication time of the heterochromatic regions of the 
megachromosome was -9-1 1 h. At the rate of movement of replication 
forks, 0.5-5 kb per minute, that is typical of eukaryotic chromosomes 
25 [Kornberg et ak (1992) DNA Replication. 2nd. ed.., New York: W.H. 

Freeman and Co, p. 474], replication of a -15 Mb replicon would require 
50-500 h. Alternatively, if only a single replication origin was used, the 
average replication speed would have to be 25 kb per minute to complete 
replication within 10 h. By comparing the intensity of the BrdU signals on 
30 the euchromatic and the heterochromatic chromosome segments, no 
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evidence for a 5- to 50-fold difference in their replication speed was 
found. 

2. Using short BrdU pulse labelling, a single origin of replication 
would produce a replication band that moves along the replicon, reflecting 
the movement of the replication fork. In contrast, a widening of the 
replication zone that finally gave rise to the checkered pattern of the 
megachromosome and within the replication period the most intensive 
BrdU incorporation occurred in the second half of the S-phase was 
observed. This suggests that once the megareplicator has been 
activated, it permits the activation and firing of "secondary" origins, and 
that the replication of the bulk of the satellite DNA takes place from these 
"secondary" origins during the second half of the S-phase. This is 
supported by the observation that in certain stages of the replication of 
the megachromosome, the whole amplicon can apparently be labelled by 
a short BrdU pulse. 

Megareplicators and secondary replication origins seem to be under 
strict temporal and spatial control. The first initiation within the 
megachromosomes usually occurred at the centromere, and that shortly 
afterward all the megareplicators become active. The last segment of the 
megachromosome to complete replication was usually the second 
segment of the long arm. Results of control experiments with mouse A9 
chromosomes indicate that replication of the heterochromatin of mouse 
chromosomes corresponds to the replication of the megachromosome 
amplicons. Therefore, the pre-existing temporal control of replication in 
the heterochromatic blocks is preserved in the megachromosome. 
Positive [Hassan etaL (1994) J. Cell. Sci. 107:425-434] and negative 
[Haase et aL (1 994) Mol. Cell. Biol. 14:251 6-2524] correlations between 
transcriptional activity and initiation of replication have been proposed. In 
the megachromosome, transcription of the integrated genes seems to 
have no effect on the original timing of the replication origins. The 
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concerted, precise timing of the megareplicator initiations in the different 
amplicons suggests the presence of specific, cis-acting sequences, origins 
of replication. 

Considering that pericentric heterochromatin of mouse 
5 chromosomes contains thousands of short, simple repeats spanning 7- 
15 Mb, and the centromere itself may also contain hundreds of kilobases, 
the existence of a higher-order unit of replication seems probable. The 
observed uncontrolled intrachromosomal amplification restricted to a 
replication initiation region of the megachromosome is highly suggestive 

10 of a rolling-circle type amplification, and provides additional evidence for 
the presence of a replication origin in this region. 

The finding that a specific replication initiation site occurs at the 
boundaries of amplicons suggests that replication might play a role in the 
amplification process. These results suggest that each amplicon of the 

15 megachromosome can be regarded as a huge megareplicon defined by a 
primary initiation site [megareplicator] containing "secondary" origins of 
replication. Fusion of replication bubbles from different origins of bi- 
directional replication [DePamphilis (1993) Ann. Rev. Biochem. 62:29-63] 
within the megareplicon could form a giant replication bubble, which 

20 would correspond to the whole megareplicon. In the light of this, the 
formation of megabase-size amplicons can be accommodated by a 
replication-directed amplification mechanism. In both H and E-type 
amplifications, intrachromosomal multiplication of the amplicons was 
observed [see, above EXAMPLES], which is consistent with the unequal 

25 sister chromatid exchange model. Induced or spontaneous unscheduled 
replication of a megareplicon in the constitutive heterochromatin may also 
form new amplicon(s) leading to the expansion of the amplification or to 
the heterochromatic polymorphism of "normal" chromosomes. The 
"restoration" of the missing segment on the long arm of the 
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megachromosome may well be the result of the re-replication of one 
amplicon limited to one strand. 

Taken together, without being bound by any theory, a replication- 
directed mechanism is a plausible explanation for the initiation of large- 
scale amplifications in the centromeric regions of mouse chromosomes, as 
well as for the de novo chromosome formations. If specific [amplificator] 
sequences play a role in promoting the amplification process, sequences 
at the primary replication initiation site [megareplicator] of the 
megareplicon are possible candidates. 

Preliminary sequence data indicates the presence of highly G + C- 
rich sequence elements less than 10 kb from the integrated heterologous 
"foreign" DNA in the megachromosome. These sequences may represent 
the non-satellite DNA flanking of the A + T-rich satellite DNA blocks. 

EXAMPLE 9 

Generation of chromosomes with amplified regions derived from mouse 
chromosome 1 

To show that the events described in EXAMPLES 2-7 are not 
unique to mouse chromosome 7 and to show that the EC7/3 cell line is 
not required for formation of these chromosomes, the experiments have 
been repeated using different initial cell lines and DNA fragments. Any cell 
or cell line should be amenable to use or can readily be determined to be 
amenable or not. 

A. Materials 

The LP1 1 cell line was produced by the "scrape-loading " 
transfection method [Fechheimer et aL (1987) Proc. Natl. Acad. Sci. 
U.S.A. 84 :8463-8467] using 25 fjg plasmid DNA for 5 x 10 6 recipient 
cells. LP1 1 cells were maintained in F-12 medium containing 3-15 //g/ml 
Puromycin [SIGMA]. 

B. Amplification in LP1 1 cells 

The large-scale amplification described in the above Examples is 
not restricted to the transformed EC3/7 cell line or to the chromosome 7 
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of mouse. In an independent transformation experiment, using a 
selectable puromycin construct pPuroTel, an LMTK cell line [LP1 1] was 
established, carrying chromosome(s) with amplified chromosome 
segments of different lengths [ ~ 1 50-600 Mb]. Cytological analysis of 
5 the LP1 1 cells indicated that the amplification occurred in the pericentric 
region of the long arm of a submetacentric chromosome formed by 
Robertsonian translocation. This chromosome arm was identified by G- 
banding as chromosome 1 . C-banding and in situ hybridization with 
mouse major satellite DNA probe showed that an E-type amplification had 

10 occurred: the newly formed region was composed of an array of 

euchromatic chromosome segments containing different amounts of 
heterochromatin. The size and C-band pattern of the amplified segments 
were heterogeneous. In several cells, the number of these amplified units 
exceeded 50; single-cell subclones of LP 1 1 cell lines, however, carry 

15 stable marker chromosomes with 10-15 segments and constant C-band 
patterns. 

EXAMPLE 10 
Purification of artificial chromosomes 
I. Cell Sorting based on base composition and size 
20 A. Cell Lines 

1B3 mouse-hamster-human hybrid cells [see, Figure 4] carrying the 
megachromosome or the truncated megachromosome were grown in F-12 
medium supplemented with 10% fetal calf serum, 1 50 /vg/ml hygromycin 
B and 400 yug/ml G418. GHB-42 [a cell line recloned from G3D5] mouse- 
25 hamster hybrid cells carrying the megachromosome and the mini- 
chromosome were cultured in F-12 medium containing 10% fetal calf 
serum, 1 50 jug/m\ hygromycin B and 400 //g/ml G41 8. The doubling time 
of both cell lines was about 24 hours. 
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B. Chromosome Isolation 

To accumulate mitotic cells, 5 /yg/ml colchicine was added for 12 
hours to the cultures. Mitotic cells were then harvested by gentle 
pipetting of the medium on the layer cells. The mitotic index obtained 
5 was 60-80%. The mitotic cells by were collected by selective 

detachment. The cells were sedimented by centrifugation of 200 g for 10 
minutes. 

Two procedures were used to prepare metaphase chromosomes 
from these cells, one based on polyamine buffer system. [Cram et aL 
10 (1 990) Methods in Cell Biology 33:377-382] and the other on modified 
hexylene glycol buffer system [Hadlaczky et aL (1982) Chromosoma 
86:643-65]. 

1 . Polyamine procedure 

In the polyamine procedure, about 10 7 mitotic cells were incubated 

15 in 10 ml hypotonic buffer (75 mM KCI, 0.2 mM spermine, 0.5 mM 

spermidine) for 10 minutes at room temperature to swell the cells. The 
cells were then centrifuged at 1 00 g for 8 minutes. The cell pellet was 
drained carefully and about 10 7 cells were resuspended in 1 ml polyamine 
buffer [15 mM Tris-HCI, 20 mM NaCI, 80 mM KC1, 2 mM EDTA, 0.5 mM 

20 EGTA, 14 mM /?-mercaptoethanol, 0.1% digitonin, 0.2 mM Spermine, 0.5 
mM spermidine]. Chromosomes were then released by gently drawing 
the cell suspension up and expelling it through a 22 G needle attached to 
a 3 ml plastic syringe. The chromosome concentration was about 1- 
3x1 0 8 chromosomes/ml. 

25 2. Hexylene glycol buffer system 

In the second procedure, about 8 x 10 6 mitotic cells were 
resuspended in 10 ml glycine-hexylene glycol buffer [100 mM glycine, 
1% hexylene glycol, pH 8.4-8.6 adjusted with saturated Ca-hydroxide 
solution] and incubated for 10 minutes at 37°C, followed by 

30 centrifugation for 10 minutes to pellet the nuclei. The supernatant was 
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centrifuged again at 200 g for 20 minutes to pellet the chromosomes. 
Chromosomes were resuspended in 1 ml isolation buffer/1 -3x1 0 8 
chromosomes. 

C. Staining of chromosomes with DNA specific dyes 

5 Subsequent to isolation, the chromosome preparation was stained 

with Hoechst 33258 at 6 /vg/ml and chromocycin A3 at 200 /vg/ml. 
Fifteen minutes prior to analysis, 25 mM Na-sulphite and 10 mM Na- 
citrate were added to the chromosome suspension. 

D. Flow sorting of chromosomes 

10 Chromosomes in suspension were passed through a dual-laser cell 

sorter [FACStar Plus and FAXStar Vantage Becton Dickinson 
Immunocytometry System] in which two lasers were set to excite the 
dyes separately, allowing a bivariate analysis of the chromosome by size 
and base-pair composition. Because of the difference between the base 

15 composition of the MACs and the other chromosomes and the resulting 
difference in interaction with the dyes, as well as size differences, the 
artificial chromosomes were separated from the other chromosomes. 

E. Storage of the Sorted Artificial Chromosomes 

The sorted chromosomes are stored in GH buffer (100 mM glycine, 
20 1% hexylene glycol pH 8.4—8.6 adjusted with saturated Ca-hydroxide 

solution [see, e.g. , Hadlaczky et aL (1 982) Chromosoma 86:643-659] for 
one day and embedded by centrifugation into agarose. The sorted 
chromosomes were centrifuged into an agarose bed and the plugs are 
stored in 500 mM EDTA at 4° C. They are stored for microinjection in 
25 30% glycerol at -20° C. 

F. Quality Control 

1 . Analysis of the purity 
The purity of the sorted chromosomes was checked by 
fluorescence in situ hybridization (FISH) with biotin labeled mouse satellite 
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DNA probe [see, Hadlaczky et al (1 991 ) Proc. Natl. Acad. Sci. U.S.A. 

88:8106-8110. Purity of the sorted chromosomes was 97 — 99%. 

2. Characteristics of the sorted chromosomes 
Pulsed field gel electrophoresis and Southern hybridization were 

carried out to determine the size distribution of the DNA content of the 

sorted artificial chromosomes. 

C. Functioning of the artificial chromosomes 

To check whether their activity is preserved, the artificial 

chromosomes are microinjected into primary cells, somatic cells and stem 

cells. 

II. Sorting of mammalian artificial chromosome containing microcells 

A. Micronucleation 

Cells were growns to 80 — 90% confluency in 4 T150 flasks. 
Colcemid was added to a final concentration of 0.06 jug/ml, and then 
incubated with the cells at 37° C for 24 hours. 

B. Enucleation 

Ten //g/ml cytochalasin B was added and the resulting microcells 
were centrifuged the at 15,000 rpm for 70 minutes at 28 — 33° C. 

C. Purification of microcells by filtration 

The microcells were purified using Swinnex filter units and 
Nucleopore filters [5//m and 3/vm]. 

D. Staining and sorting microcells 

As above, the cells were stained Hoechst and chromomycin A3 
dyes. The microcells were sorted by cell sorter to isolate the microcells 
that contain the mammalian artificial chromosome. 

E. Fusion 

The microcells that contain the artificial chromosome are fused to 
selected primary cells, somatic cells, embryonic stem cells to generate 
transgenic animals for gene therapy purposes, and other cells to deliver 
the chromosomes to the cells. 
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EXAMPLE 1 1 

Introduction of mammalian artificial chromosomes into insect cells 

Insect cells should be useful hosts for MACs, particularly for 
production of gene products for a number of reasons, including: 
5 1. A mammalian artificial chromosome provides extra genomic 

specific integration site for introduction of genes encoding proteins of 
interest [reduced chance of mutation in production system]. 

2. The large size of artificial chromosome permits megabase size 
DNA integration so that genes encoding an entire pathway leading to a 

10 protein or nonprotein of therapeutic value, such as an alkaloid [digitalis, 

morphine, taxol]. 

3. Amplification of genes encoding useful proteins can be 
accomplished in the artificial mammalian chromosome to obtain higher 
protein yields in insect cells. 

15 4. Insect cells support required post translational modifications 

(glycosylation, phosphorylation) essential for protein biological function. 

5. Insect cells do not support mammalian viruses — eliminates 
cross-contamination of product with human infectious agents. 

6. The ability to introduce chromosomes, circumvents 

20 traditional recombinant baculovirus systems for production of nutritional, 
industrial or medicinal proteins in insect cell systems. 

7 The low temperature optimum for insect cell growth (28° C) 
permits reduced energy cost of production. 

8. Serum free growth medium for insect cells will result in 

25 lower production costs. 

9. Artificial chromosome containing cells can be stored 
indefinitely at low temperature. 

10. Insect larvae will serve as biological factories for the 
production of nutritional, medicinal or industrial proteins by microinjection 

30 of fertilized insect eggs. 
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A. Demonstration that insect cells recognize mammalian promoters 

Gene constructs containing a mammalian promoter, such as CMV 
linked to DNA encoding a detectable marker gene fusions [Renilla 
luciferase gene (see, e^, U.S. Patent No. 5,292,658 for a description of 
DNA encoding the Renilla luciferase, and plasmid pTZrLuc-1, which can 
provide the starting material for construction of such vectors, see SEQ ID 
No. 10] and also including the simian virus 40 (SV40) promoter operably 
linked to the beta galactosidase gene] was introduced into the cells of 
two species Trichoplusia ni [cabbage looper] and Bombyx mori [silk 
worm]. 

After transferring the constructs into the insect cell lines either by 

electroporation or by microinjection, expression of the marker genes was 

detected after a 24 h incubation. In each case a positive result was 

obtained in the samples containing the genes which was absent in 

samples in which the genes were omitted. In addition, a /?-actin 

promoter - Renilla luciferase fusion was introduced into the T. ni and B. 

mori cells yielding light emission. Thus, mammalian promoters function to 

direct expression of these marker genes in insects. Therefore, MACs are 

candidates for expression of heterologous genes in insect cells. 

B. Construction of vectors for use in insect cells and fusion with 
mammalian cells 

1 . Transform LMTK" cells with expression vector with: 

a. B. mori /?-actin promoter— Hyg r selectable marker 
gene for insect cells, and. 

b. SV40 or CMV promoters controlling a puromycin r 
selectable marker gene for mammalian cells. 

2. Detect expression of the mammalian promoter in LMTk cells 
(puromycin r LMTk cells) 

3. Use pur r cells in fusion experiments with Bombyx and 
Trichoplusia cells, select Hyg r cells. 
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C. Insertion of the MACs into insect cells 

These experiments are designed to detect expression of a 
detectable marker gene [such as /?-gal expressed under the control of a 
mammalian promoter, such as pSV40 ] located on a MAC. Data indicate 
that /?-gal was expressed. 

Insect cells of each species are fused with mammalian cells 
containing either the mini chromosome [EC3/7C5] or the mini and the 
megachromosome [such as GHB-42, which is a cell line recloned from 
G3D5] or a cell line that carries only the megachromosome [such as 
H1 D3 or a redone therefrom]. Fusion is carried out as follows: 

1 . mammalian + insect cells (50/50%) in log phase growth are 
mixed; 

2. calcium / PEG cell fusion: (10 min — 0.5 h); 

3. heterokaryons ( + 72 h) are selected. 

The following selection conditions to select for insect cells that 
contain a MAC can be used: [+ = positive selection; - = negative 
selection]: 

1 . growth at 28° C ( + insect cells, - mammalian cells); 

2. Graces insect cell medium [SIGMA] (- mammalian cells); 

3. no exogenous C02 (- mammalian cells); and/or 

4. antibiotic selection (Hyg or G418) (+ transformed insect cells). 
Immediately following the fusion protocol, many heterokaryons 

[fusion events] are observed between the mammalian and each species of 
insect cells [up to 90% heterokaryons]. After growth [2+ weeks] on 
insect medium containing G418 and/or hygromycin at selection levels 
used for selection of transformed mammalian cells, individual colonies are 
detected growing on the fusion plates. By virtue of selection for the 
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antibiotic resistance conferred by the MAC and selection for insect cells, 
these colonies should contain MACs. 

EXAMPLE 12 

Preparation of the chromosome fragmentation vector and other vectors 
5 for targeted integration of DNA into MACs 

Fragmentation of the megachromosome, should ultimately result in 
smaller stable chromosomes that contain about 15 Mb to 50 Mb that will 
be easily manipulated for use as vectors. Vectors to effect such 
fragmentation should also aid in determination and identification of the 
10 elements required for preparation of an in vitro -produced artificial 
chromosome. 

Reduction in the size of the megachromosome can be achieved in a 

number of different ways including: stress treatment, such as by 

starvation, or cold or heat treatment; treatment with agents that 

15 destabilize the genome or nick DNA, such as BrdU, coumarin, EMS and 

others; treatment with ionizing radiation [see, e.g., Brown (1992) Curr. 

O pin. Genes Dev. 2:479-486]; and telomere-directed in vivo chromosome 

fragmentation [see, e^, Far et aL d 995) EMBO_J, 14:5444-5454]. 

A. Preparation vectors for fragmentation of the artificial 
20 chromosome and also for targeted integration of selected 

gene products 

1 . Construction of pTEMPUD 

Plasmid pTEMPUD [see, Figure 5] is a mouse homologous 
recombination "killer" vector for in vivo chromosome fragmentation, and 

25 also for inducing large-scale amplification via site specific integration. 
With reference to Figure 5, the Pstl to Sail fragment was derived from 
pBabe-puro retroviral vector [see, Morgenstern et aL (1 990) Nucleic Acids 
Res. 18 :3587-3596]. This fragment contains DNA encoding ampicillin 
resistance, the pUC origin of replication, and the puromycin N-acetyl 

30 transferase gene under control of the SV40 early promoter. The URA3 
portion comes from the pYAC5 cloning vector [SIGMA]. URA3 was cut 
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out of pYAC5 with Sall-Xhol digestion, cloned into pNEB193 [New 
England Biolabs], which was then cut with EcoRI-Sall and ligated to the 
Sail site of pBabepuro to produce pPU. 

A 1293 bp fragment [see SEQ ID No. 1] encoding the mouse major 
satellite, was isolated as an EcoRI fragment from a DNA library produced 
from mouse LMTK" fibroblast cells and inserted into the EcoRI site of pPU 

to produce pMPU. 

The TK promoter driven diphtheria toxin gene [DT-A] was derived 
from pMC1DT-A [see, Maxwell et aL (1986) Cancer Res. 46:4660-4666] 
by Bglll-Xhol digestion and cloned into the pMCIneo poly A expression 
vector [STRATAGENE, La Jolla, CA] by replacing the neo coding 
sequence. The TK promoter, DT-A gene and poly A sequence were 
removed from this vector, cohesive ends were filled with Klenow and the 
resulting fragment blunt end-ligated and ligated into the SnaB1 [TACGTA] 
of pMPU to produce pMPUD. 

The Hutel 2.5 fragment [see SEQ ID No. 3] was inserted at the Pstl 
site [see the 6100 Pstl - 3625 Pstl fragment on pTEMPUD] of pMPUD to 
produce pTEMPUD. This fragment includes a human telomere. It 
includes a unique Bglll site [see nucleotides 1042-1047 of SEQ ID No. 3], 
which will be used as a site for introduction of a synthetic telomere that 
will include multiple repeats [80] of GGGATT with BamHI and Bglll ends 
for insertion into the Bglll site which will then remain unique, since the 
BamHI overhang is compatible with the Bglll site. Ligation of BamHI 
fragment to a Bglll destroys the Bglll site, so that only a single Bglll site 
will remain. Selection for the unique Bglll site insures that the synthetic 
telomere will be inserted in the correct orientation. The unique Bglll site 
is the site at which the vector is linearized. 
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2. Use of pTEMPUD for m vivo chromosome fragmentation 

Linearization of pTEMPUD by Bglll results in a linear molecule with 
a human telomere at one end. Integration of this linear fragment into the 
chromosome, such as the megachromosome in hybrid cells or any mouse 
5 chromosome, which is contains repeats of the mouse major satellite 
sequence, results integration of the selectable marker puromycin and 
cleavage of the plasmid by virtue of the telomeric end. The DT gene 
prevents that entire linear fragment from integrating by random events, 
since upon integration and expression it is toxic. Thus random integration 
10 will be toxic. Thus, site directed integration into the targeted DNA will be 
selected. Such integration will produce fragmented chromosomes. 

The fragmented truncated chromosome with the new telomere will 
survive, and the other fragment without the centromere will be lost. 
Repeated jn vivo fragmentations will ultimately result in selection of the 
15 smallest functioning minichromosome possible. 

Thus this vector can be used to produce minichromosomes from 
mouse chromosomes, or to fragment the megachromosome. 
3. pTEMPhu and pTEMPhu3 
Vectors that specifically target human chromosomes can be 
20 constructed from pTEMPUD. These vectors can be used to fragment 
specific human chromosomes, depending upon the selected satellite 
sequence, to produce human minichromosomes, and also to isolate 
human centromeres. 

a. pTEMPhu 

25 To render pTEMPUD suitable for fragmenting human chromosomes, 

the mouse major satellite sequence is replaced with human satellite 
sequences. Unlike mouse chromosomes, each human chromosome has a 
unique satellite sequence. For example, the mouse major satellite has 
been replace with a human hexameric or-satellite [or alphoid satellite] DNA 

30 sequence. This sequence is an 81 3 bp fragment [nucleotide 232-1 044 of 
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SEQ ID No. 2] from clone pS12, deposited in the EMBL database under 
Accession number X60716, isolated from a human colon carcinoma cell 
line Colo320 [deposited under Accession No. ATCC CCL 220.1]. The 
813 bp alphoid fragment can be obtained from the pS12 clone by nucleic 
acid amplification using synthetic primers, which each contain an EcoRI 
site, as follows: 

GGGGAATTCAT TGGGATGTTT CAGTTGA forward primer [SEQ ID No. 4] 
CGAAAGTCCCC CCTAGGAGAT CTTAAGGA reverse primer [SEQ ID No. 5]. 

Digestion of the amplified product with EcoRI results in a fragment 
with EcoRI ends that includes the human a-satellite sequence. This 
sequence is inserted into pTEMPUD in place of the EcoRI fragment that 
contains the mouse major satellite. 

b. pTEMPhu3 

In pTEMPhu3, the mouse major satellite sequence is replaced by 
the 3kb human chromosome 3-specific a-satellite from D3Z1 [deposited 
under ATCC Accession No. 85434; see, also Yrokov (1989) Cytogenet. 

Cell Genet. 51 :1 114]. 

4. Use of the pTEMPHU3 to induce amplification on human 
chromosome #3 

Each human chromosome contains unique chromosome-specific 
alphoid sequence. Thus, use of pTEMPHU3, which is targeted to the 
chromosome 3-specific ar-satellite can be introduced into human cells 
under selective conditions, whereby large scale amplification of the 
chromosome 3 centromeric region and production of a de novo 
chromosome. Such induced large-scale amplification provides a means 
for inducing de novo chromosome formation and also for jn vivo cloning 
of defined human chromosome fragments up to megabase size. 

For example, the break-point in human chromosome #3 is on the 
short arm near the centromere, this region is involved in renal cell 
carcinoma formation. By targeting pTEMPhu3 to this region, the induced 
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large-scale amplification may contain this region, which can then be 

cloned using the bacterial and yeast markers in the pTEMPhu3 vector. 

The pTEMPhu3 cloning vector allows not only selection for 

homologous recombinants, but also direct cloning of the integration site in 

YACS. This vector can also be used to target human chromosome #3, 

preferably with a deleted short arm, in a mouse-human monochromosomal 

microcell hybrid line. Homologous recombinants can be screened by 

nucleic acid amplification (PCR) and amplification can be screened by 

DNA hybridization, Southern hybridization, and in situ hybridization. The 

amplified region can be cloned into YAC. This vector and these methods 

also permit a functional analysis of cloned chromosome regions by 

reintroducing the cloned amplified region into mammalian cells. 

B. Preparation of libraries in YAC vectors for cloning of centromeres 
and identification of functional chromosomal units 

Another method that may be used to obtain smaller-sized functional 
mammalian artificial chromosome units and to clone centromeric DNA 
involves screening of mammalian DNA YAC vector-based libraries and 
functional analysis of potential positive clones in a transgenic mouse 
model system. A mammaliam DNA library is prepared in a YAC vector, 
such as YRT2 [see Schedl et aL (1 993) Nuc. Acids Res. 21 :4783-4787], 
which contains the murine tyrosinase gene. The library is screened for 
hybridization to mammalian telomere and centromere sequence probes. 
Positive clones are isolated and microinjected into pronuclei of fertilized 
oocytes of NMRI/Han mice following standard techniques. The embryos 
are then transferred into NMRI/Han foster mothers. Expression of the 
tyrosinase gene in transgenic offspring confers an identifiable phenotype 
(pigmentation). The clones that give rise to tyrosinase-expressing 
transgenic mice are thus confirmed as containing functional mammalian 
artificial chromosome units. 

Alternatively, fragments of SATACs may be introduced into the 
YAC vectors and then introduced into pronuclei of fertilized oocytes of 
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NMRI/Han mice following standard techniques as above. The clones that 
give rise to tyrosinase-expressing transgenic mice are thus confirmed as 
containing functional mammalian artificial chromosome units, particularly 
centromeres. 

5 C. Incorporation of Heterologous Genes into Mammalian Artificial 
Chromosomes through The Use of Homology Targeting Vectors 

As described above, the use of mammalian artificial chromosomes 
for expression of heterologous genes obviates certain negative effects 
that may result from random integration of heterologous plasmid DNA into 
10 the recipient cell genome. An essential feature of the mammalian artificial 
chromosome that makes it a useful tool in avoiding the negative effects of 
random integration is its presence as an extra-genomic gene source in 
recipient cells. Accordingly, methods of specific, targeted incorporation 
of heterologous genes exclusively into the mammalian artificial 
15 chromosome, without extraneous random integration into the genome of 
recipient cells, are desired for heterologous gene expression from a 
mammalian artificial chromosome. 

One means of achieving site-specific integration of heterologous 
genes into artificial chromosomes is through the use of homology 
20 targeting vectors. The heterologous gene of interest in subcloned into a 
targeting vector which contains nucleic acid sequences that are 
homologous to nucleotides present in the artificial chromosome. The 
vector is then introduced into cells containing the artificial chromosome 
for specific site-directed integration into the artificial chromosome through 
25 a recombination event at sites of homology between the vector and the 
chromosome. The homology targeting vectors may also contain 
selectable markers for ease of identifying cells that have incorporated the 
vector into the artificial chromosome as well as lethal selection genes that 
are expressed only upon extraneous integration of vector into the 
30 recipient cell genome. Two exemplary homology targeting vectors, ACF-7 
and p/lCF-7-DTA, are described below. 
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1 . Construction of Vector >ICF-7 

Vector /ICF-7 contains the cystic fibrosis transmembrane 
conductance regulator [CFTR] gene as an exemplary therapeutic molecule- 
encoding nucleic acid that may be incorporated into mammalian artificial 
5 chromosomes for use in gene therapy applications. This vector, which 
also contains the puromycin-resistance gene as a selectable marker, as 
well as the Saccharomvces cerevisiae ura3 gene [orotidine-5-phosphate 
decarboxylase], was constructed in a series of steps as follows, 
a. Construction of pURA 

10 Plasmid pURA was prepared by ligating a 2.6-kb Sall/Xhol fragment 

from the yeast artificial chromosome vector pYAC5 [Sigma; see also 
Burke et aL (1 987) Science 236:806-81 2 for a description of YAC 
vectors as well as GenBank Accession no. U01086 for the complete 
sequence of pYAC5] containing the S, cerevisiae ura3 gene with a 3.3-kb 

15 Sall/ Sma l fragment of pHyg [see, e^, U.S. Patent Nos. 4,997,764, 

4,686,186 and 5,162,215,. and the description above]. Prior to ligation 
the Xho l end was treated with Klenow polymerase for blunt end ligation 
to the Sma l end of the 3.3 kb fragment of pHyyg. Thus, pURA contains 
the S, cerevisiae ura3 gene, and the E. coJi ColE1 origin of replication and 

20 the ampicillin-resistance gene. The uraE gene is included to provide a 
means to recover the integrated construct from a mammalian cell as a 
YAC clone. 

b. Construction of pUP2 

Plasmid pURA was digested with Sail and ligated to a 1 .5-kb 
25 Sail fragment of pCEPUR. Plasmid pCEPUR is produced by ligating the 1.1 
kb SnaBI- Nha l fragment of pBabe-puro [Morgenstern et aL (1 990) NucL 
Acids Res. 18:3587-3596; provided by Dr. L. Szekely (Microbiology and 
Tumorbiology Center, Karolinska Institutet, Stockholm); see, also 
Tonghua et aL (1995) Chin. Med. J. (Beijing, Engl. Ed.) 108:653-659; 
30 Couto et aL (1 994) Infect. Immun. 62:2375-2378; Dunckley et aL (1 992) 
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FEBS Lett. 296 : 1 28-34; French et aL (1 995) Anal. Biochem. 228:354- 
355; Liu et aL < 1 995) Blood 85: 1 095-1 1 03; International PCT application 
Nos. WO 9520044; WO 9500178, and WO 9419456] to the Nhel-Nrul 
fragment of pCEP4 [Invitrogen]. 

The resulting plasmid, pUP2, contains the all the elements of pURA 
plus the puromycin-resistance gene linked to the SV40 promoter and 
polyadenylation signal from pCEPUR. 

c. Construction of pUP-CFTR 

The intermediate plasmid pUP-CFTR was generated in order 
to combine the elements of pUP2 into a plasmid along with the CFTR 
gene. First, a 4.5-kb Sail fragment of pCMV-CFTR that contains the 
CFTR-encoding DNA [see, also, Riordan et aL (1 989) Science 245:1066- 
1073, U.S. Patent No. 5,240,846, and Genbank Accession no. M28668 
for the sequence of the CFTR gene] containing the CFTR gene only was 
ligated to Xho l-digested pCEP4 [Invitrogen and also described herein] in 
order to insert the CFTR gene in the multiple cloning site of the Epstein 
Barr virus-based (EBV) vector pCEP4 [Invitrogen, San Diego, CA; see also 
Yates etaL (1985) Nature 313 :812-815; see, also U.S. Patent No. 
5,468,615] between the CMV promoter and SV40 polyadenylation signal. 
The resulting plasmid was designated pCEP-CFTR. Plasmid pCEP-CFTR 
was then digested with Sail and the 5.8-kb fragment containing the CFTR 
gene flanked by the CMV promoter and SV40 polyadenylation signal was 
ligated to Sail-digested pUP2 to generate pUP-CFTR. Thus, pUP-CFTR 
contains all elements of pUP2 plus the CFTR gene linked to the CMV 
promoter and SV40 polyadenylation signal. 

d. Construction of vlCF-7 

Plasmid pUP-CFTR was then linearized by partial digestion 
with Eco RI and the 13 kb fragment containing the CFTR gene was ligated 
with EcoRI-digested Charon 4AA [see Blattner et aL (1977) Science 
1 96:1 61 ; Williams and Blattner (1 979) i Virol. 29:555 and Sambrook et 
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aL (1989) Molecular Cloning, A Laboratory Manual , Second Ed., Cold 
Spring Harbor Laboratory Press, Volume 1, Section 2.18, for descriptions 
of Charon 4A/t]. The resulting vector, ACF8, contains the Charon 4-AA 
bacteriophage left arm, the CFTR gene linked to the CMV promoter and 
SV40 polyadenylation signal, the ura3 gene, the puromycin-resistance 
gene linked to the SV40 promoter and polyadenylation signal, the 
thymidine kinase promoter [TK], the ColE1 origin of replicaton, the 
amplicillan resistance gene and the Charon 4-AA bacteriophage right arm. 
The ACF8 construct was then digested with Xho l and the resulting 27.1 
kb was ligated to the 0.4kb Xhol/EcoRI fragment of pJBP86 [described 
below], containing the SV40 polyA signal and the EcoRI-digested Charon 
4A A right arm. The resulting vector ACF-7 contains the Charon 4A A left 
arm, the CFTR encoding DNA linked to the CMV promoter and SV40 
polyA signal, the ura3 gene, the puromycin resistance gene linked to the 
SV40 promoter and polyA signal adn the Charon 4A A right arm. The A 
DNA fragments provide sequences homologous to nucleotides present in 
the exemplary artificial chromosomes. 

The vector is then introduced into cells containing the artificial 
chromosomes exemplified herein. Accordingly, when the linear ACF-7 
vector is introduced into megachromosome-carrying fusion cell lines, such 
as described herein, it will be specifically integrated into the 
megachromosome through recombination between the homologous 
bacteriophage A sequences of the vector and the artificial chromosome. 
2. Construction of Vector ylCF-7-DTA 

Vector /ICF-7-DTA also contains all the elements contained in ACF- 
7, but additionally contains a lethal selection marker, the diptheria toxin-A 
(DT-A) gene as well as the ampicillin-resistance gene and an origin of 
replication. This vector was constructed in a series of steps as follows. 
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a. Construction of pJBP86 

Plasmid pJBP86 was used in the construction of ^CF-7, above. A 
1.5-kb Sail fragment of pCEPUR containing the puromycin-resistance gene 
linked to the SV40 promoter and polyadenylation signal was ligated to 
5 Hind Ill-digested pJB8 [see, e^, Ish-Horowitz et aL ( 1 981 ) Nucleic Acids 
Res. 9:2989-2998; available from ATCC as Accession No. 37074; 
commercially available from Amersham, Arlington Heights, IL]. Prior to 
ligation the Sail ends of the 1 .5 kb fragment of pCEPUR and th4 HmdJII 
linearized pJB8 ends were treated with Klenow polymerase before 
10 ligation. The resulting vector pJBP86 contains the puromycin resistance 
gene linked to the SV40 promoter and polyA signal, the 1 .8 kb COS 
region of Charon 4-AA, the ColE1 origin of replication and the ampicillin 

resistance gene. 

b. Construction of pMEP-DTA 

15 A 1.1-kb Xhol/Sall fragment of pMCI-DT-A [see, e^, Maxwell et 

aL (1986) Cancer Res. 46:4660-4666] containing the diptheria toxin-A 
gene was ligated to Xhol-digested pMEP4 [Invitrogen, San Diego, CA] to 
generate pMEP-DTA. To produce pMC1-DT-A, the coding region of the 
DTA gene was isolated as a 800 bp Pst l Hind lll fragment from p2249-1 

20 and inserted into pMC1 neopolyA [pMC1 available from Stratagene] in 
place of the neo gene and under the control of the TK promotoer. The 
resulting construct pMC1DT-A was digested with Hind lll, the ends filled 
by Klenow and Sail linkers were ligated to produce a 1061 bp TK-DTA 
gene cassette with an Xhgl end [5'] and a Sail end containing the 270 bp 

25 TK promoter and the -790 bp DT-A fragment. This fragment was ligated 
into Xho l-digested pMEP4 . 

Plasmid pMEP-DTA thus contains the DT-A gene linked to the TK 
promoter and SV40, ColE1 origin of replication and the ampicillin- 
resistance gene. 



-94- 



6869-402A 



c. Construction of pJB83-DTA9 

Plasmid pJB8 was digested with Hindlll and C]al and ligated 
with an oligonucleotide [see SEQ ID NOs. 7 and 8 for the sense and 
antisense strands of the oligonucleotide, respectively] to generate pJB83. 
5 The oligonucleotide that was ligated to CJa I/Hind Ill-digested pJB8 
contained the recognition sites of Swa l, Pact and SrfJ restriction 
endonucleases. These sites will permit ready linearization of the p/lCF-7- 
DTA construct. 

Next, a 1 .4-kb Xho l/Sall fragment of pMEP-DTA, containing the 
10 DT-A gene was ligated to Sall-digested pJB83 to generate pJB83-DTA9. 

d. Construction of >lCF-7-DTA 

The 12-bp overhangs of /tCF-7 were removed by Mung bean 
nuclease and subsequent T4 polymerase treatments. The resulting 41.1- 
kb linear /iCF-7 vector was then ligated to pFB83-DTA9 which had been 

15 digested with Oal and treated with T4 polymerase. The resulting vector, 
^CF-7-DTA, contains all the elements of ACF-7 as well as the DT-A gene 
linked to the TK promoter and the SV40 polyadenylation signal, the 
1.8 kB Charon 4A A COS region, the ampicllin-resistance gene[from 
pJB83-DTA9] and the Col E1 origin of replication [from pJB83-DT9A]. 

20 3. pMCT-RUC, 

Plasmid pMCT-RUC [14kbp] was constructed for site-specific 
targeting of the Renilla luciferase [see, e^, U.S. Patent Nos. 5,292,658 
and 5,418,155 for a description of DNA encoding Renilla luciferase, and 
plasmid pTZrLuc-1, which can provide the starting material for 

25 construction of such vectors] gene to a mammalian chromosome. The 
relevant features of this plasmid are the Renilla luciferase gene under 
transcriptional control of the human cytomegalovirus immediate-early 
gene enhancer/promoter; the hygromycin gene under the transcriptional 
control of the thymidine kinase promoter; and a unique Hpal site is for 

30 linearizing the plasmid. 
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This construct was introduced into C5 cells [see, Lorenz et aT 
(1996) J. Biolum. Chemilum. 11:31-37]. C5 mouse fibroblasts were 
maintained as a monolayer [see, Gluzman (1 981 ) CeJI 23:1 75-1 83]. Cells 
at 50% confluency in 100 mm Petri dishes were used for calcium 
5 phosphate transfection [see, Harper et aL (1 981 ) Chromosoma 83:431- 
439] using 10/vg of linearized pMCT-RUC per plate. Colonies originating 
from single transfected cells were isolated and maintained in F-12 medium 
containing hygromycin (300 /vg/mL) and 10% fetal bovine serum. Cells 
were grown in 100 mm Petri dishes prior to the Renilla luciferase assay. 
10 The Renilla luciferase assay was performed [see, e.g., Matthews 

et aL<1977) Biochemistry 16:85-91]. Hygromycin-resistant cell lines 
obtained after transfection of mouse fibroblasts with linearized plasmid 
pMCT-RUC ["B" cell lines] were grown to 100% confluency for measure- 
ments of light emission in vivo and in vitro . Light emission was measured 
15 in vivo after about 30 generations as follows: growth medium was 
removed and replaced by 1 mL RPMI 1640 containing coelenterazine 
[1 mmol/L final concentration]. Light emission from cells was then 
visualized by placing the Petri dishes in a low light video image analyzer 
[Hamamatsu Argus-100]. An image was formed after 5 min. of photon 
20 accumulation using 100% sensitivity of the photon counting tube. For 
measuring light emission in vitro, cells were trypsinized and harvested 
from one Petri dish, pelleted, resuspended in 1 mL assay buffer [0.5 mol/L 
NACI, 1 mmol/L EDTA, 0.1 mol/L potassium phosphate, pH 7.4] and 
sonicated on ice for 10 s. Lysates were than assayed in a Turner TD-20e 
25 lukminometer for 10 s after rapid injection of 0.5 mL of 1 mmol/L 

coelenterazine, and the average value of light emission was recorded as 
LU [1 LU = 1.6 x 106 hu/s for this instrument]. 

Independent cell lines of mouse fibroblasts transfected with 
linearized plasmid pMCT-RUC showed different levels of Renilla luciferase 
30 activity. Similar differences in light emission were observed when 
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measurements were performed on lysates of the same cell lines. This 
variation in light emission was probably due to a position effect resulting 
from the random integration of plasmid pMCT-RUC into the mouse 
genome, since enrichment for site targeting of the luciferase gene was 
not performed in this experiment. 
D. Protein secretion targeting vectors 

Isolation of heterologous proteins produced intracellular^ in 
mammalian cell expression systems requires cell disruption under 
potentially harsh conditions and purification of the recombinant protein 
from cellular contaminants. The process of protein isolation may be 
greatly facilitated by secretion of the recombinant^ produced protein into 
the extracellular medium where there are fewer contaminants to remove 
during purification. Therefore, secretion targeting vectors have been 
constructed for use with the mammalian artificial chromosome system. 

A useful model vector for demonstrating production and secretion 
of heterologous protein in mammalian cells contains DNA encoding a 
readily detectable reporter protein fused to an efficient secretion signal 
that directs transport of the protein to the cell membrane and secretion of 
the protein from the cell. Vectors pLNCX-ILRUC and pLNCX-ILRUC/l, 
described below, are examples of such vectors. These vectors contain 
DNA encoding an interleukin-2 (IL2) signal peptide-Renjlja reniformis 
luciferase fusion protein. The IL-2 signal peptide [encoded by the 
sequence set forth in SEQ ID No. 9] directs secretion of the luciferase 
protein, to which it is linked, from mammalian cells. Upon secretion from 
the host mammalian cell, the IL-2 signal peptide is cleaved from the 
fusion protein to deliver mature, active, luciferase protein to the 
extracellular medium. Successful production and secretion of this 
heterologous protein can be readily detected by performing luciferase 
assays which measure the light emitted upon exposure of the medium to 
the bioluminescent luciferin substrate of the luciferase enzyme. 
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1 . Construction of Protein Secretion Vector pLNCX-ILRUC 

Vector pLNCX-ILRUC contains a human IL-2 signal peptide-FL reniformis 
fusion gene linked to the human cytomegalovirus (CMV) immediate early 
promoter for constitutive expression of the gene in mammalian cells. The 
5 construct was prepared as follows. 

a. Preparation of the IL-2 signal sequence-encoding DNA 
A 69-bp DNA fragment containing DNA encoding the human IL-2 
signal peptide was obtained through nucleic acid amplification, using 
appropriate primers for IL-2, of an HEK 293 cell line [see, e^, U.S. 
10 Patent No. 4,518,584 for an IL-2 encoding DNA; see, also SEQ ID No. 9; 
the IL-2 gene and corresponding amino acid sequence is also provided in 
the Genbank Sequence Database as accession nos. K02056 and 
J00264]. The signal peptide includes the first 20 amino acids shown in 
the translations provided in both of these Genbank entries and in SEQ ID 
15 NO. 9. The corresponding nucleotide sequence encoding the first 20 
amino acids is also provided in these entries [see, e.g., nucleotides 293- 
52 of accession no. K02056 and nucleotides 478-537 of accession no. 
J00264), as well as in SEQ ID NO. 9. The amplification primers included 
an Eco RI site [GAATTC] that for subcloning of the DNA fragment into 
20 EcoRI-digested pGEMT [Promega]. The forward primer is set forth in SEQ 
ID No. 1 1 and the sequence of the reverse primer is set forth in SEQ ID 
No. 12. 

TTTGAATTCATGTACAGGATGCAACTCCTG forward [SEQ ID No. 11] 

TTTGAATTCAGTAGGTGCACTGTTTGTGAC revserse [SEQ ID No. 12] 

25 b. Preparation of the iL. reniformis luciferase-encoding 

DNA 

The initial source of the FL reniformis luciferase gene was 
plasmid pLXSN-RUC. Vector pLXSN [see, e^, U.S. Patent Nos. 
5,324,655, 5,470,. 730, 5,468,634, 5,358,866 and Miller et aL (1 989) 
30 Biotechniaues 7:980] is a retroviral vector capable of expressing 

heterologous DNA under the transcriptional control of the retroviral LTR; i 
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also contains the neomycin-resistance gene operatively linked for 
expression to the SV40 early region promoter. The FL reniformis 
luciferase gene was obtained from plasmid pTZrLuc-1 [see, e^, U.S. 
Patent No. 5,292,658; see also the Genbank Sequence Database 
5 accession no. M63501 ; and see also Lorenz et aL (1 991 ) Proc. Natl. 

Acad. Sci. U.S.A. 88:4438-4442] and is shown as SEQ ID NO. 10. The 
0.97 kb EcoRl/ Sma l fragment of pTZrLuc-1 contains the coding region of 
the Renilla luciferase-encodig DNA. Vector pLXSN was digested with 
and ligated with the luciferase gene contained on a pLXSN-RUC, which 
10 contains the luciferase gene located operably linked to the viral LTR and 
upstream of the SV40 promoter, which directs expression of the 
neomycin-resistance gene. 

c. Fusion of DNA encoding the IL-2 Signal Peptide and 
the FL reniformis Luciferase Gene to Yield pLXSN- 

15 ILRUC 

The pGEMT vector containing the IL-2 signal peptide-encoding DNA 
described in l.a. above was digested with EcoRI, and the resulting 
fragment encoding the signal peptide was ligated to EcoRI-digested 
pLXSN-RUC. The resulting plasmid, called pLXSN-ILRUC, contains the IL- 
20 2 signal peptide-encoding DNA located immediately upstream of the FL 
reniformis gene in pLXSN-RUC. Plasmid pLXSN-ILRUC was then used as 
a template for nucleic acid amplification of the fusion gene in order to add 
a Sma l site at the 3' end of the fusion gene. The amplification product 
was subcloned into EcoRI/Smal-digested pGEMT (Promega) to generate 

25 ILRUC-pGEMT. 

d. Introduction of the Fusion Gene into a Vector 
Containing Control Elements for Expression in 
Mammalian Cells 

Plasmid ILRUC-pGEMT was digested with Kspl and Smal to 

30 release a fragment containing the IL-2 signal peptide-luciferase fusion 

gene which was ligated to Hpal-digested pLNCX. Vector pLNCX [see, 

e.g. , U.S. Patent Nos. 5,324,655 and 5,457,182; see, also Miller and 
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Rosman (1989) Biotechniaues 7:980-990] is a retroviral vector for 
expressing heterologous DNA under the control of the CMV promoter; it 
also contains the neomycin-resistance gene under the transcriptional 
control of a viral promoter. The vector resulting from the ligation reaction 
was designated pLNCX-ILRUC. Vector pLNCX-ILRUC contains the IL-2 
signal peptide-luciferase fusion gene located immediately downstream of 
the CMV promoter and upstream of the viral 3' LTR and polyadenylation 
signal in pLNCX. This arrangement provides for expression of the fusion 
gene under the control of the CMV promoter. Placement of the 
heterologous protein-encoding DNA [he,, the luciferase gene] in operative 
linkage with the IL-2 signal peptide-encoding DNA provides for expression 
of the fusion in mammalian cells transfected with the vector such that the 
heterologous protein is secreted from the host cell into the extracellular 
medium. 

2. Construction of Protein Secretion Targeting Vector pLIMCX- 
ILRUCyt 

Vector pLNCX-ILRUC may be modified so that it can be used to 
introduce IL-2 signal peptide-luciferase fusion gene into a mammalian 
artificial chromosome in a host cell. To facilitate specific incorporation of 
the pLNCX-ILRUC expression vector into a mammalian artificial 
chromosome, nucleic acid sequences that are homologous to nucleotides 
present in the artificial chromosome are added to the vector to permit site 
directed recombination. 

Exemplary artificial chromosomes described herein contain lambda 
phage DNA. Therefore, protein secretion targeting vector pLNCX-ILRUC! 
was prepared by addition of lambda phage DNA [from Charon 4A arms] 
to produce the secretion vector pLNCX-ILRUC. 
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3. Expression and Secretion of FL reniformis Luciferase from 
Mammalian Cells 

a. Expression of IL. reniformis Luciferase Using pLNCX- 
ILRUC 

Mammalian cells [LMTK" from the ATCC] were transiently 
transfected with vector pLNCX-ILRUC [~10//g] by electroporation 
[BIORAD, performed according to the manufacturer's instructions]. Stable 
transfectants produced by growth in G418 for neo selection have also 
been prepared. 

Transfectants were grown and then analyzed for expression of 
luciferase. To determine whether active luciferase was secreted from the 
transfected cells, culture media were assayed for luciferase by addition of 
coelentrazine [see, e.g. , Matthews et aL (1977) Biochemistry 16:85-91]. 

The results of these assays establish that vector pLNCX-ILRUC is 
capable of providing constitutive expression of heterologous DNA in 
mammalian host cells. Furthermore, the results demonstrate that the 
human IL-2 signal peptide is capable of directing secretion of proteins 
fused to the C-terminus of the peptide. Additionally, these data 
demonstrate that the R^ reniformis luciferase protein is a highly effective 
reporter molecule, which is stable in a mammalian cell environment, and 
forms the basis of a sensitive, facile assay for gene expression. 

b. Expression of R_, reniformis Luciferase Using pLNCX- 
ILRUCyl 

To express the IL-2 signal paptide-R. reniformis fusion gene from an 
artificial mammalian chromosome, vector pLNCX-ILRUC/1 is targeted for 
site-specific integration into an artificial mammalian chromosome through 
homologous recombination of the lambda DNA sequences contained in 
the chromosome and the vector. This is accomplished by introduction of 
pLNCX-ILRUC/4 into either a fusion cell line harboring mammalian artificial 
chromosomes or mammalian host cells that contain artificial mammalian 
chromosomes. If the vector is introduced into a fusion cell line harboring 
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the artificial chromosomes, for example through microinjection of the 
vector or transfection of the fusion cell line with the vector, the cells are 
then grown under selective conditions, Le^ artificial chromosomes, which 
have incorporated vector pLNCX-ILRUC/t, are isolated from the surviving 
5 cells, using purification procedures as described above, and then injected 
into the mammalian host cells. 

Alternatively, the mammalian host cells may first be injected with 
mammalian artificial chromosomes which have been isolated from a fusion 
cell line. The host cells are then transfected with vector pLNCX-ILRUC^ 

10 and grown. 

The recombinant host cells are then assayed for luciferase 

expression as described above. 
D. Other targeting vectors 

These vector rely on positive and negative selection to insure 
15 insertion and selection for the double recombinants. A single crossover 
results in incorporation of the DT-A, which kills the cell, double crossover 
recombinations delete the DT-1 gene. 

1. Plasmid pNEM1 contains: 

DT-A: Diphtheria toxin gene (negative selectable marker) 

20 Hyg: Hygromycin gene (positive selectable marker) 

rue: Renilla luciferase gene (non-selectable marker) 

1 : LTR-MMTV promoter 

2: TK promoter 

3: CMV promoter 

25 MMR: Homology region (plasmid pAG60) 

2. plasmid pNEM-2 and -3 are similar to pNEM 1 except for 
different negative selectable markers: 



pNEM-1 
pNEM-2 
30 pNEM-3 



diphtheria toxin gene as " — " selectable marker 
hygromycin antisense gene as " — " selectable marker 
thymidine kinase HSV-1 gene as "-" selectable marker 
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3. 



Plasmid - lambda DNA based homology. 



pNEIVM-1: 
pNEIVM-2: 



base vector 



base vector containing p5 = gene 



1 



LTR MMTV promoter 



2 



SV40 promoter 



3: CMV promoter 

4: /jTWA promoter (metallothionein gene promoter) 
— homology region (plasmid pAG60) 

A LA. and A R.A. homology regions for A left and right arms 



(A gt-WES). 



EXAMPLE 13 



Microinjection of mammalian cells with plasmid DNA 

These procedure will be used to microinject MACS into eukaryotic 
cells, including mammalian and insect cells. 

The microinjection technique is based on the use of small glass 
capillaries as a delivery system into cells and has been used for 
introduction of DNA fragments into nuclei [see, e.g., Chalfie et aL (1994) 
Science 263 :802-804]. It allows the transfer of almost any type of 
molecules, e.g. , hormones, proteins, DNA and RNA, into either the 
cytoplasm or nuclei of recipient cells This technique has no cell type 
restriction and is more efficient than other methods, including 
Ca 2+ — mediated gene transfer and liposome-mediated gene transfer. 
About 20-30% of the injected cells become successfully transformed. 

Microinjection is performed under a phase-contrast microscope. A 
glass microcapillary, prefilled with the DNA sample, is directed into a cell 
to be injected with the aid of a micromanipulator. An appropriate sample 
volume (1-10 pi) is transferred into the cell by gentle air pressure exerted 
by a transjector connected to the capillary. Recipient cells are grown on 
glass slides imprinted with numbered squares for convenient localization 
of the injected cells. 
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a. Materials and equipment 

Nunclon tissue culture dishes 35 x 10 mm, Mouse cell line EC3/7C5 
Plasmid DNA pCH110 [Pharmacia], Purified Green Florescent Protein 
(GFP) [GFPs from Aequorea and Renilla have been purified and also DNA 
5 encoding GFPs has been cloned; see, e^, Prasher et aL (1992) Gene 
111:229-233; International PCT Application No. WO 95/07463, which is 
based on U.S. application Serial No. 08/1 19,678 and U.S. application 
Serial No. 08/192,274], ZEISS Axiovert 100 microscope, Eppendorf 
transjector 5246, Eppendorf micromanipulator 5171, Eppendorf Cellocate 
10 coverslips, Eppendorf microloaders, Eppendorf femtotips and other 
standard equipment 

b. Protocol 

(1) Fibroblast cells are grown in 0 35 mm 
tissue culture dishes (37° C r 5% C0 2 ) until the cell density reaches 80% 

15 confluency. The dishes are removed from the incubator and medium to 
added to about a 5 mm depth. 

(2) The dish is placed onto the dish holder 
and the cells observed with 10 x objective; the focus is desirably above 
the cell surface. 

20 (3) Plasmid or chromosomal DNA solution 

[1 ng///l] and GFP protein solution are further purified by centrifuging the 
DNA sample at force sufficient to removed any particular debris [typically 
about 10,000 rpm for 10 minutes in a microcentrifuge]. 

(4) Two 2 fj\ of the DNA solution (1 ng///l) is 
25 loaded into a microcapillary with an Eppendorf microloader. During 

loading, the loader is inserted to the tip end of the microcapillary. GFP (1 
mg/ml) is loaded wit the same procedure. 

(5) The protecting sheath is removed from the 
microcapillary and the microcapillary is fixed onto the capillary holer 

30 connected with the micromanipulator. 
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(6) The capillary tip lowered to the surface of 
the medium and focus on the cells gradually until the tip of the capillary 
reaches the surface of a cell. Lower the capillary further so that the 
capillary is inserted into the cell. Various parameters, such as the level of 
5 the capillary, the time and pressure are determined for the particular 

equipment. For example, using the fibroblast cell line C5 and the above- 
noted equipment, the best conditions are: injection time 0.4 second, 
pressure 80 psi. DNA can then be automatically injected into the nuclei 
of the cells. 

10 (7) After injection, the cells are returned to 

the incubator, and incubated for about 18-24 hours. 

(8) After incubation the number of 
transformants can be determined by a suitable method, which depends 
upon the selection marker. For example, if green fluorescent protein is 

15 used, the assay can be performed using UV light source and fluorescent 
filter set at 0-24 hours after injection. If £-gal-containing DNA, such as 
DNA-derived from pHC1 10, has been injected, then the transformants can 
be assayed for yff-gal. 

i. Detection of /?-galactosidase in 
20 cells injected with plasmid DNA. The medium is removed from the 

culture plate and the cells are fixed by addition of 5 ml of fixation Solution 
I: (1% glutaraldehyde; 0.1 M sodium phosphate buffer, pH 7.0; 1 mM 
MgCI 2 ), and incubated for 15 minutes at 37° C. Fixation Solution I is 
replaced with 5 ml of X-gal Solution II: [0.2% X-gal, 10 mM sodium 
25 phosphate buffer (pH 7.0), 150 mM NaCI, 1 mM MgCI 2 , 3.3 mM 

K 4 Fe(CN) 6 H 2 0, 3.3 mM K 3 Fe(CN) 6 ], and the plates are incubated for 30- 
60 minutes at 37° C. 

The X-gal solution is removed and 2 ml of 70% glycerol is added to 
each dish. Blue stained cells are identified under a light microscope. 
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This will be used to introduce a MAC, particular the MAC with the 
anti-HIV megachromome, to produce a mouse model for anti-HIV activity. 

EXAMPLE 14 

Transgenic animals 

Transgenic animals can be generated that express heterologous 
genes which confer desired traits, e^, disease resistance, in the animals. 
A transgenic mouse is prepared to serve as model of a disease-resistant 
animal. Genes that encode vaccines or that encode therapeutic molecules 
can be introduced into embryos or ES cells to produce animals that 
express the gene product and thereby are resistant to or less susceptible 
to a particular disorder. 

The mammalian artificial megachromosome can be used to generate 
transgenic animals that stably express genes conferring desired traits, 
such as genes conferring resistance to pathogenic viruses. Transgenic 
mice containing a transgene encoding an anti-HIV ribozyme provide a 
useful model for the development of stable transgenic animals using these 
methods. 

1 . Development of Control Transgenic Mice Expressing Anti-HIV 
Ribozyme 

Control transgenic mice are generated in order to compare stability 
and amounts of transgene expression in mice developed using transgene 
DNA carried on a vector (control mice) with expression in mice developed 
using transgenes carried in an artificial megachromosome. 

a. Development of Control Transgenic Mice Expressing /?- 
galactosidase 

One set of control transgenic mice was generated by microinjection 
of mouse embryos with the yS-galactosidase gene alone. The 
microinjection procedure used to introduce the plasmid DNA into the 
mouse embryos is as described in Example 13, but modified for use with 
embryos [see, e^, Hogan et aL (1 994) Manipulating the Mouse Embryo, 
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A .-Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, see, especially pages 255-264 and Appendix 3]. Fertilized 
mouse embryos [Strain CB6 obtained from Charles River Co.] were 
injected with 1 ng of plasmid pCH1 10 (Pharmacia) which had been 
5 linearized by digestion with Bam HI. This plasmid contains the fi- 

galactosidase gene linked to the SV40 late promoter. The /?-galactosidase 
gene product provides a readily detectable marker for successful 
transgene expression. Furthermore, these control mice provide 
confirmation of the microinjection procedure used to introduce the 
10 plasmid into the embryos. Additionally, because the megachromosome 
that is transferred to the mouse embryos in the model system (see below) 
also contains the yff-galactosidase gene, the control transgenic mice that 
have been generated by injection of pCH1 10 into embryos serve as an 
analogous system for comparison of heterologous gene expression from a 
15 plasmid versus from a gene carried on an artifical chromosome. 

After injection, the embryos are cultured in modified HTF medium 
under 5% CO z at 37°C for one day until they divide to form two cells. 
The two-cell embryos are then implanted into surrogate mother female 
mice [for procedures see, Manipulating the Mou se Embryo, A Laboratory 
20 Manual (1 994) Hogan et aL, eds., Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, NY, pp. 127 et seq. ]. 

b. Development of Control Transgenic Mice Expressing 
Anti-HIV Ribozyme 
One set of anti-HIV ribozyme gene-containing control transgenic 
25 mice was generated by microinjection of mouse embryos with plasmid 
pCEPUR-132 which contains three different genes: (1) DNA encoding an 
anti-HIV ribozyme, (2) the puromycin-resistance gene and (3) the 
hygromycin-resistance gene. Plasmid pCEPUR-132 was constructed by 
ligating portions of plasmid pCEP-132 containing the anti-HIV ribozyme 
30 gene (referred to as ribozyme D by Chang et aL [(1990) Clin. Biotech. 

2:23-31]; see also U.S. Patent No. 5,144,019 to Rossi et aL., particularly 
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Figure 4 of the patent) and the hygromycin-resistance gene with a portion 
of plasmid pCEPUR containing the puromycin-resistance gene. 

Plasmid pCEP-132 was constructed as follows. Vector pCEP4 
(Invitrogen, San Diego, CA; see also Yates et ak (1 985) Nature 31 3:81 2- 
5 81 5) was digested with Xho l which cleaves in the multiple cloning site 
region of the vector. This ~ 10.4-kb vector contains the hygromycin- 
resistance gene linked to the thymidine kinase gene promoter and 
polyadenylation signal, as well as the ampicillin-resistance gene and ColE1 
origin of replication and EBNA-1 (Epstein-Barr virus nuclear antigen) genes 
10 and OriP. The multiple cloning site is flanked by the cytomegalovirus 
promoter and SV40 polyadenylation signal. 

Xhol-digested pCEP4 was ligated with a fragment obtained by 
digestion of plasmid 132 (see Example 4 for a description of this plasmid) 
with Xho l and Sail. This Xhol/Sall fragment contains the anti-HIV 
15 ribozyme gene linked at the 3' end to the SV40 polyadenylation signal. 
The plasmid resulting from this ligation was designated pCEP-132. Thus, 
in effect, pCEP-132 comprises pCEP4 with the anti-HIV ribozyme gene 
and SV40 polyadenylation signal inserted in the multiple cloning site for 
CMV promoter-driven expression of the anti-HIV ribozyme gene. 
20 To generate pCEPUR-132, pCEP-132 was ligated with a fragment 

of pCEPUR. pCEPUR was prepared by ligating a 7.7-kb fragment 
generated upon Nhel/Nrul digestion of pCEP4 with a 1.1 -kb Nhel/SnaBI 
fragment of pBabe [see Morgenstern and Land (1990) Nucleic Acids Res. 
18:3587-3596 for a description of pBabe] that contains the puromycin- 
25 resistance gene linked at the 5' end to the SV40 promoter. Thus, 

pCEPUR is made up of the ampicillin-resistance and EBNA1 genes, as well 
as the ColE1 and OriP elements from pCEP4 and the puromycin-resistance 
gene from pBabe. The puromycin-resistance gene in pCEPUR is flanked 
by the SV40 promoter (from pBabe) at the 5' end and the SV40 
30 polyadenylation signal (from pCEP4) at the 3' end. 
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Plasmid pCEPUR was digested with Xho l and Sail and the fragment 
containing the puromycin-resistance gene linked at the 5' end to the 
SV40 promoter was ligated with Xhol-digested pCEP-132 to yield the 
~12.1-kb plasmid designated pCEPUR-132. Thus, pCEPUR-132, in 
5 effect, comprises pCEP-132 with puromycin-resistance gene and SV40 
promoter inserted at the Xho l site. The main elements of pCEPUR-132 
are the hygromycin-resistance gene linked to the thymidine kinase 
promoter and polyadenylation signal, the anti-HIV ribozyme gene linked to 
the CMV promoter and SV40 polyadenylation signal, and the puromycin- 
10 resistance gene linked to the SV40 promtoer and polyadenylation signal. 
The plasmid also contains the ampicillin-resistance and EBNA1 genes and 
the ColE1 origin of replication and OriP. 

Zygotes were prepared from (C57BL/6JxCBA/J) F1 female mice 
[see, e.g.. Manipulating the Mouse Embrvo. A Laboratory Manual (1994) 
15 Hogan et ah, eds., Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, p. 429], which had been previously mated with a 
(C57BL/6JxCBA/J) F1 male. The male pronuclei of these F2 zygotes 
were injected [see, Manipulating the Mous e Embrvo. A Laboratory Manual 
(1994) Hogan et aL, eds.. Cold Spring Harbor Laboratory Press, Cold 
20 Spring Harbor, NY] with pCEPUR-1 32 ( ~ 3 /jg/m\), which had been 

linearized by digestion with Nrul. The injected eggs were then implanted 
in surrogate mother female mice for development into transgenic 
offspring. 

These primary carrier offspring were analyzed (as described below) 
25 for the presence of the transgene in DNA isolated from tail cells. Seven 
carrier mice that contained transgenes in their tail cells (but that may not 
carry the transgene in all their cells, i.e., they may be chimeric) were 
allowed to mate to produce non-chimeric or germ-line heterozygotes. The 
heterozygotes were, in turn, crossed to generate homozygote transgenic 
30 offspring. 
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2. Development of Model Transgenic Mice Using 
Mammalian Artificial Chromosomes 

Fertilized mouse embryos are microinjected (as described above) 

with megachromosomes (1-10 pL containing 0-1 chromosomes/pL) 

5 isolated from fusion cell line G3D5* or H1D3* (described above). The 

megachromosomes are isolated as described herein. Megachromosomes 

isolated from either cell line carry the anti-HIV ribozyme (ribozyme D) 

gene as well as the hygromycin-resistance and /?-galactosidase genes. 

The injected embryos are then developed into transgenic mice as 

10 described above. 

Alternatively, the megachromosome-containing cell line G3D5 or 
H1D3* is fused with mouse embryonic stem cells [see, e.g. , U.S. Patent 
No. 5,453,357, commerically available; see Manipulating the Mouse 
Embrvo. A I ahoratorv Manual (1 994) Hogan et aL, eds., Cold Spring 

15 Harbor Laboratory Press, Cold Spring Harbor, NY, pages 253-289] 
following standard procedures see also, e^ Guide to Techniques in 
Mouse Development in Methods in Enzvmoloqv Vol. 25, Wassarman and 
De Pamphilis, eds. (1993), pages 803-932]. (It is also possible to deliver 
isolated megachromosomes into embryonic stem cells using the Microcell 

20 procedure [such as that described above].) The stem cells are cultured in 
the presence of a fibroblast [ e.g. , STO fibroblasts that are resistant to 
hygromycin and puromycin]. Cells of the resultant fusion cell line, which 
contains megachromosomes carrying the transgenes [Le^ anti-HIV 
ribozyme, hygromycin-resistance and £-galactosidase genes], are then 

25 transplanted into mouse blastocysts, which are in turn implanted into a 
surrogate mother female mouse where development into a transgenic 

mouse will occur. 

Mice generated by this method are chimeric in that the transgenes 
will be expressed in only certain areas of the mouse, e.g., the head, and 
30 thus may not be expressed in all cells. 
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3. Analysis of Transgenic Mice for Transgene Expression 

Beginning when the transgenic mice, generated as described 
above, are three-to-four weeks old, they can be analyzed for stable 
expression of the transgenes that were transferred into the embryos [or 
5 fertilized eggs] from which they develop. The transgenic mice may be 
analyzed in several ways as follows. 

a. Analysis of Cells Obtained from the Transgenic 
Mice 

Cell samples [ e.g. , spleen cells, lymphocytes, tail cells] are 
10 obtained from the transgenic mice. Any cells may be tested for transgene 
expression. If, however, the mice are chimeras generated by 
microinjection of fertilized eggs with fusions of embryonic stem cells with 
megachromosome-containing cells, only cells from areas of the mouse 
that carry the transgene are expected to express the transgene. If the 
15 cells survive growth on hygromycin [or hygromycin and puromycin or 
neomycin, if the cells are obtained from mice generated by transfer of 
both antibiotic-resistance genes], this is one indication that they are 
stably expressing the transgenes. RNA isolated from the cells according 
to standard methods may also be analyzed by northern blot procedures to 
20 determine if the cells express transcripts that hybridize to nucleic acid 
probes based on the antibiotic-resistance genes. 

Additionally, cells obtained from the transgenic mice may also be 
analyzed for yS-galactosidase expression using standard assays for this 
marker enzyme [for example, by direct staining of the product of a 
25 reaction involving /?-galactosidase and the X-gal substrate, see, e^, 

Jones (1986) EMBO 5:31 33-31 42, or by measurement of £-galactosidase 
activity, see, e^. Miller (1972) in Experiments in Molecular Genetics pp. 
352-355, Cold Spring Harbor Press]. Analysis of £-galactosidase 
expression is particularly used to evaluate transgene expression in cells 
30 obtained from control transgenic mice in which the only transgene 
transferred into the embryo was the yS-galactosidase gene. 
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Stable expression of the anti-HIV ribozyme gene in cells obtained 
from the transgenic mice may be evaluated in several ways. First, DNA 
isolated from the cells according to standard procedures may be subjected 
to nucleic acid amplification using primers corresponding to the ribozyme 
gene sequence. If the gene is contained within the cells, an amplified 
product of pre-determined size is detected upon hybridization of the 
reaction mixture to a nucleic acid probe based on the ribozyme gene 
sequence. Furthermore, DNA isolated from the cells may be analyzed 
using Southern blot methods for hybridization to such a nucleic acid 
probe. Second, RNA isolated from the cells may be subjected to northern 
blot hybridization to determine if the cells express RNA that hybridizes to 
nucleic acid probes based on the ribozyme gene. Third, the cells may be 
analyzed for the presence of anti-HIV ribozyme activity as described, for 
example, in Chang et aL (1 990) Clin. Biotech. 2:23-31 . In this analysis, 
RNA isolated from the cells is mixed with radioactively labeled HIV gag 
target RNA which can be obtained by in vitro transcription of gag gene 
template under reaction conditions favorable to in vitro cleavage of the 
gag target, such as those described in Chang et aL (1 990) Clin. Biotech. 
2:23-31 . After the reaction has been stopped, the mixture is analyzed by 
gel electrophoresis to determine if cleavage products smaller in size than 
the whole template are detected; presence of such cleavage fragments is 
indicative of the presence of stably expressed ribozyme. 

b. Analysis of Whole Transgenic Mice 
Whole transgenic mice that have been generated by transfer of the 
anti-HIV ribozyme gene [as well as selection and marker genes] into 
embryos or fertilized eggs can additionally be analyzed for transgene 
expression by challenging the mice with infection with HIV. It is possible 
for mice to be infected with HIV upon intraperitoneal injection with 
high-producing HIV-infected U937 cells [see, e.g., Locardi et aL (1 992) X 
Virol. 66:1649-1654]. Successful infection may be confirmed by analysis 
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of DNA isolated from cells, such as peripheral blood mononuclear cells, 
obtained from transgenic mice that have been injected with HIV-infected 
human cells. The DNA of infected transgenic mice cells will contain 
HIV-specific gag. and env sequences, as demonstrated by, for example, 
nucleic acid amplification using HIV-specific primers. If the cells also 
stably express anti-HIV ribozyme, then analysis of RNA extracts of the 
cells should reveal the smaller gag. fragments arising by cleavage of the 
gag transcript by the ribozyme. 

Additionally, the transgenic mice carrying the anti-HIV ribozyme 
gene can be crossed with transgenic mice expressing human CD4 (i.e., 
the cellular receptor for HIV) [see Gillespie et aL (1 993) Mol. CeJL BioL 
13:2952-2958; Hanna et aL (1 994) Mol. CeJL BioL 14:1084-1094; and 
Yeung et aL (1994) J. Exp. Med. 180:191 1-1920, for a description of 
transgenic mice expressing human CD4]. The offspring of these crossed 
transgenic mice expressing both the CD4 and anti-HIV ribozyme 
transgenes should be more resistant to infection [as a result of a 
reduction in the levels of active HIV in the cells] than mice expressing 
CD4 alone [without expressing anti-HIV ribozyme]. 

Since modifications will be apparent to those of skill in this art, it is 
intended that this invention be limited only by the scope of the appended 
claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION 



(i) APPLICANT: Hadlaczky, Gyula 
Szalay, Aladar 



(ii) TITLE OF THE INVENTION: ARTIFICIAL CHROMOSOMES, USES THEREOF 
AND METHODS FOR PREPARING ARTIFICIAL CHROMOSOMES 

(iii) NUMBER OF SEQUENCES: 12 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Brown, Martin, Haller & McClain 

(B) STREET: 1660 Union Street 

(C) CITY: San Diego 

(D) STATE : CA 

(E) COUNTRY : USA 

(F) ZIP: 92101-2926 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ Version 1.5 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 10-APR-1996 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Seidman, Stephanie L 

(B) REGISTRATION NUMBER: 33,779 

(C) REFERENCE /DOCKET NUMBER: 6869-402 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 619-238-0999 

(B) TELEFAX: 619-238-0062 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1293 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE : 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 




60 
120 
180 
240 
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GCCATATTTC ACGTCCTAAA ATGTGTATTT CTCGTTTNCC GTGATTTTCA GTTTTCTCGC 3 00 

CAGATTCCAG GTCCTATAAT GTGCATTTCT CATTTNNCAC GTTTTTCAGT GATTTCGTCA 36 0 

TTTTTTCAAG TCGGCAAGTG GATGTTTCTC ATTTNCCATG ATTTNCAGTT TTCTTGNAAT 42 0 

ATTCCATGTC CTACAATGAT CATTTTTAAT TTTCCACCTT TTCATTTTTC CACGCCATAT 480 

TTCATGTCCT AAAGTGTATA TTTCTCCTTT TCCGCGATTT TCAGTTTTCT CGCCATATTC 54 0 

CAGGTCCTAC AGTGTGCATT CCTCATTTTT C AC CTTTTTC ACTGATTTCG TCATTTTTCA 6 00 

AGTCGTCAAC TGGATCTTTC TAATTTTCCA TGATTTTCAG TTATCTTGTC ATATTCCATG 66 0 

TCCTACAGTG GACATTTCTA AATTTTCCAA CTTTTTCAAT TTTTCTCGAC ATATTTGACG 72 0 

TGCTAAAGTG TGTATTTCTT ATTTTCCGTG ATTTTCAGTT TTCTCGCCAT ATTCCAGGTC 780 

CTAATAGTGT GCATTTCTCA TTTTTCACGT TTTTCAGTGA TTTCGTCATT TTTTCCAGTT 84 0 

GTCAAGGGGA TGTTTCTCAT TTTCCATGAG TGTCAGTTTT CTTGCTATAT TCCATGTCCT 900 

ACAGTGACAT TTCTAAATAT TATACCTTTT TCAGTTTTTC TCACCATATT TCACGTCCTA 96 0 

AAGTATATAT TTCTCATTTT CCCTGATTTT CAGTTTCCTT GCCATATTCC AGGTCCTACA 102 0 

GTGTGCATTT CTCATTTTTC ACGTTTTTCA GTAATTTCTT CATTTTTTAA GCCCTCAAAT 108 0 

GGATGTTTCT CATTTTCCAT GATTTTCAGT TTTCTTGCCA TATACCATGT CCTACAGTGG 1140 

ACATTTCTAA ATTATCCACC TTTTTCAGTT TTTCATCGGC ACATTTCACG TCCTAAAGTG 12 00 

TGTATTTCTA ATTTTCAGTG ATTTTCAGTT TTCTCGCCAT ATTCCAGGAC CTACAGTGTG 12 6 0 

CATTTCTCAT TTTTCACGTT TTTCAGTGAA TTC 1293 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1044 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

AGGCCTATGG TGAAAAAGGA AATATCTTCC CCTGAAAACT AGACAGAAGG ATTCTCAGAA 6 0 

TCTTATTTGT GATGTGCGCC CCTCAACTAA CAGTGTTGAA GCTTTCTTTT GATAGAGCAG 120 

TTTTGAAACA CTCTTTTTGT AAAATCTGCA AGAGGATATT TGGATAGCTT TGAGGATTTC 180 

CGTTGGAAAC GGGATTGTCT TCATATAAAC CCTAGACAGA AGCATTCTCA GAAGCTTCAT 24 0 

TGGGATGTTT CAGTTGAAGT CACAGTGTTG AACAGTCCCC TTTCATAGAG CAGGTTTGAA 3 00 

ACACTCTTTT TTGTAGTATC TGGAAGTGGA CATTTGGAGC GATCTCAGGA CTGCGGTGAA 360 

AAAGGAAATA TCTTCCAATA AAAGCTAGAT AGAGGCAATG TCAGAAACCT TTTTCATGAT 42 0 

GTATCTACTC AGCTAACAGA GTTGAACCTT CCTTTGAGAG AGCAGTTTTG AAACACTCTT 480 

TTTGTGGAAT CTGCAAGTGG ATATTTGTCT AGCTTTGAGG ATTTCGTTGG GAAACGGGAT 54 0 

TACATATAAA AAGCAGACAG CAGCATTCCC AGAAACTTCT TTGTGATGTT TGCATTCAAG 600 

TCACAGAGTT GAACATTCCC TTTCATAGAG CAGGTTTGAA ACACACTTTT TGATGTATCT 66 0 

GGATGTGGAC ATTTGCAGCG CTTTCAGGCC TAAGGTGAAA AGGAAATATC TTCCCCTGAA 72 0 

AACTAGACAG AAGCATTCTC AGAAACTTAT TTGTGATGTG CGCCCTCAAC TAACAGTGTT 780 

GAAGCTTTCT TTTGATAGAG GCAGTTTTGA AACACTCTTT TGTGGAATCT GCAAGTGGAT 84 0 

ATTTGTCTAG CTTTGAGGAT TTCTTTGGAA ACGGGATTAC ATATAAAAAG CAGACAGCAG 90 0 

CATTCCCAGA ATCTTGTTTG TGATGTTTGC ATTCAAGTCA CAGAGTTGAA CATTCCCTTT 96 0 

CAGAGAGCAG GTTTGAACAC TCTTTTTATA GTATCTGGAT GTGGACATTT GGAGCGCTTT 102 0 

CAGGGGGGAT CCTCTAGAAT TCCT 1044 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE : 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

CTGCAGCTGG GGGTCTCCAA TCAGGCAGGG GCCCCTTACT ACTCAGATGG GGTGGCCGAG 6 0 

TAGGGGAAGG GGGTGCAGGC TGCATGAGTG GACACAGCTG TAGGACTACC TGGGGGCTGT 120 

GGATCTATGG GGGTGGGGAG AAGCCCAGTG ACAGTGCCTA GAAGAGACAA GGTGGC CTGA 18 0 

GAGGGTCTGA GGAACATAGA GCTGGCCATG TTGGGGCCAG GTCTCAAGCA GGAAGTGAGG 24 0 

AATGGGACAG GCTTGAGGAT ACTCTACTCA GTAGCCAGGA TAGCAAGGAG GGCTTGGGGT 300 

TGCTATCCTG GGGTTCAACC CCCCAGGTTG AAGGCCCTGG GGGAGATGGT C C C AGG AC AT 36 0 

ATTACAATGG ACACAGGAGG TTGGGACACC TGGAGT C AC C AAACAAAACC ATGCCAAGAG 42 0 

AGACCATGAG TAGGGGTGTC CAGTCCAGCC CTCTGACTGA GCTGCATTGT TCAAATCCAA 480 

AGGGCCCCTG CTGCCACCTA GTGGCTGATG GCATCCACAT GACCCTGGGC CACACGCGTT 54 0 

TAGGGTCTCT GTGAAGACCA AGATCCTTGT TACATTGAAC GACTCCTAAA TGAGCAGAGA 600 

TTTCCACCTA TTCGAAACAA TCACATAAAA TCCATCCTGG AAAAAGCCTG GGGGATGGCA 66 0 

CTAAGGCTAG GGATAGGGTG GGATGAAGAT TATAGTTACA GTAAGGGGTT TAGGGTTAGG 72 0 

GATCAACGTT GGTTAGGAGT TAGGGATACA GTAGGGTACC GGTAGGGTTA GGGGTTAGGG 7 80 

TTAGGGGTTA GGGTTAGGGT TAGGGTTAGG GTTAGGGTTA GGGGTTAGGG GTTAGGGTTA 840 

GGGTTAGGTT TTGGGGTGGC GTATTTTGGT CTTATACGCT GTGTTCCACT GGCAATGAAA 900 

AGAGTTCTTG TTTTTCCTTC AGCAATTTGT CATTTTTAAA AGAGTTTAGC AATTCTAACA 96 0 

GAT AT AG AC C AGCTGTGCTA TCTCATTGTG GTTTTCAATT GTAAC C AC AT TGTGGTTTCA 102 0 

ATGTGTTTAC TTGCCATCTG TAGATCTTCT TTGCGTGAGG TGTCTGTTCA GATGTGTGTG 10 80 

CATTTCTTGN NTTTNGGCTG TTTAACTTAT TGTTTAGTTT TAATAATTTT TTATATATTT 1140 

GAAGACAAAT CTTTCTCAGA TGTGTATTTG CAAATATTTC TTCAATATGA GGCTTGCTTT 12 00 

TGTCTCTAAC AAGGTCTCTT CAGAGATAAC TTAAATATAA GAAATCCACA CTGTCACTTC 126 0 

TTTTGTGTAT ATCTACCTTT TGTGTCATTT GTTAAAATTC ATTACCAAAC CCAAAGGCAG 132 0 

ATAGCTTTTC TTCTATTGTT TCTTCTAGAA ATTTGTATAG TTTTGCATTT TTAGTGTAAG 13 80 

GATGATTTTG AGTGATTATT TGTGTAAGTT GTAAAGTTTT CGTCTATATC CATATCATTT 1440 

CTTATGGTTT CCAATTAATC GTTCCCTCAC TATTTTTGGG AAAGACACAG GATAGTGGGC 1500 

TTTGTTAGAG TAGATAGGTA GCTAGACATG AACAGGAGGG GGCCTCCTGG AAAAGGGAAA 156 0 

GTCTGGGAAG GCTCACCTGG AGGACCACCA AAAATTCACA TATTAGTAGC ATCTCTAGTG 162 0 

CTGGAGTGGA TGGGCACTTG TCAATTGTGG GTAGGAGGGA AAAGAGGTCC TATGCAGAAA 1680 

GAAACTCCCT AGAACTCCTC TGAAGATGCC CCAATCATTC ACTCTGCAAT AAAAATGTCA 1740 

GAATATTGCT AGCTACATGC TGATAAGGNN AAAGGGGACA TTCTTAAGTG AAACCTGGCA 1800 

CCATAAGTAC AGATTAGGGC AGAGAAGGAC ATT C AAAAG A GGCAGGCGCA GTAGGTACAA 186 0 

ACGTGATCGC TGTCAGTGTG CCTGGGATGG CGGGAAGGAG GCTGGTGCCA GAGTGGATTC 192 0 

GTATTGATCA C C AC AC AT AT ACCTCAACCA ACAGTGAGGA GGTCCCACAA GCCTAAGTGG 1980 

GGCAAGTTGG GGAGCTAAGG CAGTAGCAGG AAAAC C AG AC AAAGAAAACA GGTGGAGACT 2 04 0 

TGAGACAGAG GCAGGAATGT GAAGAAATCC AAAATAAAAT TCCCTGCACA GGACTCTTAG 210 0 

GCTGTTTAAT GCATCGCTCA GTCCCACTCC TCCCTATTTT TCTACAATAA ACTCTTTACA 2160 

CTGTGTTTCT TTTCAATGAA GTTATCTGCC ATCTTTGTAT TGCCTCTTGG TGAAAATGTT 222 0 

TCTTCCAAGT TAAACAAGAA CTGGGACATC AGCTCTCCCC AGTAATAGCT CCGTTTCAGT 2280 

TTGAATTTAC AGAACTGATG GGCTTAATAA CTGGCGCTCT GACTTTAGTG GTGCAGGAGG 234 0 

CCGTCACACC GGGACCAAGA GTGCCCTGCC TAGTCCCCAT CTGCCCGCAG GTGGCGGCTG 24 00 

CCTCGACACT GACAGCAATA GGGTCCGGCA GTGTCCCCAG CTGCCAGCAG GGGGCGTACG 246 0 

ACGACTACAC TGTGAGCAAG AGGGCCCTGC AG 2492 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
GGGGAATTCA TTGGGATGTT TCAGTTGA 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
CGAAAGTCCC CCCTAGGAGA TCTTAAGGA 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: RNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE : 
(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
CCGCTTAATA CTCTGATGAG TCCGTGAGGA CGAAACGCTC TCGCACC 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
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(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
CGATTTAAAT TAATTAAGCC CGGGC 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
TAAATTTAAT TAATTCGGGC CCGTCGA 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(D) OTHER INFORMATION IL-2 signal sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

ATG TAC AGG ATG CAA CTC CTG TCT TGC ATT GCA CTA AGT CTT GCA CTT 
Met Tyr Arg Met Gin Leu Leu Ser Cys lie Ala Leu Ser Leu Ala Leu 

GTC ACA AAC AGT GCA CCT ACT 

Val Thr Asn Ser Ala Pro Thr 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(A) NAME /KEY: Coding Sequence 

(B) LOCATION: 1 . . . 942 

(D) OTHER INFORMATION: Renilla Reinformis Lucif erase 
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48 



96 



(x) PUBLICATION INFORMATION: 

PATENT NO. : 5,418,155 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

AGC TTA AAG ATG ACT TCG AAA GTT TAT GAT CCA GAA CAA AGG AAA CGG 
Ser Leu Lys Met Thr Ser Lys Val Tyr Asp Pro Glu Gin Arg Lys Arg 
15 10 15 

ATG ATA ACT GGT CCG CAG TGG TGG GCC AGA TGT AAA CAA ATG AAT GTT 
Met He Thr Gly Pro Gin Trp Trp Ala Arg Cys Lys Gin Met Asn Val 
20 25 30 

CTT GAT TCA TTT ATT AAT TAT TAT GAT TCA GAA AAA CAT GCA GAA AAT 144 
Leu Asp Ser Phe He Asn Tyr Tyr Asp Ser Glu Lys His Ala Glu Asn 
35 40 45 

GCT GTT ATT TTT TTA CAT GGT AAC GCG GCC TCT TCT TAT TTA TGG CGA 192 
Ala Val He Phe Leu His Gly Asn Ala Ala Ser Ser Tyr Leu Trp Arg 
50 55 60 

CAT GTT GTG CCA CAT ATT GAG CCA GTA GCG CGG TGT ATT ATA CCA GAT 24 0 

His Val Val Pro His He Glu Pro Val Ala Arg Cys He He Pro Asp 
65 70 75 80 

CTT ATT GGT ATG GGC AAA TCA GGC AAA TCT GGT AAT GGT TCT TAT AGG 28 8 

Leu He Gly Met Gly Lys Ser Gly Lys Ser Gly Asn Gly Ser Tyr Arg 
85 90 95 

TTA CTT GAT CAT TAC AAA TAT CTT ACT GCA TGG TTG AAC TTC TTA ATT 336 
Leu Leu Asp His Tyr Lys Tyr Leu Thr Ala Trp Leu Asn Phe Leu He 
100 105 HO 

TAC CAA AGA AGA TCA TTT TTT GTC GGC CAT GAT TGG GGT GCT TGT TTG 3 84 

Tyr Gin Arg Arg Ser Phe Phe Val Gly His Asp Trp Gly Ala Cys Leu 
111 120 125 

GCA TTT CAT TAT AGC TAT GAG CAT CAA GAT AAG ATC AAA GCA ATA GTT 432 
Ala Phe His Tyr Ser Tyr Glu His Gin Asp Lys He Lys Ala He Val 
130 " 135 140 

CAC GCT GAA AGT GTA GTA GAT GTG ATT GAA TCA TGG GAT GAA TGG CCT 480 
His Ala Glu Ser Val Val Asp Val He Glu Ser Trp Asp Glu Trp Pro 
145 150 155 160 

GAT ATT GAA GAA GAT ATT GCG TTG ATC AAA TCT GAA GAA GGA GAA AAA 52 8 

Asp He Glu Glu Asp He Ala Leu He Lys Ser Glu Glu Gly Glu Lys 
165 170 175 

ATG GTT TTG GAG AAT AAC TTC TTC GTG GAA ACC ATG TTG CCA TCA AAA 576 
Met Val Leu Glu Asn Asn Phe Phe Val Glu Thr Met Leu Pro Ser Lys 
180 185 190 

ATC ATG AGA AAG TTA GAA CCA GAA GAA TTT GCA GCA TAT CTT GAA CCA 624 
He Met Arg Lys Leu Glu Pro Glu Glu Phe Ala Ala Tyr Leu Glu Pro 
195 200 205 

TTC AAA GAG AAA GGT GAA GTT CGT CGT CCA ACA TTA TCA TGG CCT CGT 6 72 

Phe Lys Glu Lys Gly Glu Val Arg Arg Pro Thr Leu Ser Trp Pro Arg 
210 * ' 215 220 
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GAA ATC CCG 
Glu lie Pro 
225 

AGG AAT TAT 
Arg Asm Tyr 



TTT ATT GAA 
Phe lie Glu 



GCC AAG AAG 
Ala Lys Lys 
275 

TTT TCG CAA 
Phe Ser Gin 
290 

TTC GTT GAG 
Phe Val Glu 
305 



TTA GTA AAA 
Leu Val Lys 
230 

AAT GCT TAT 
Asn Ala Tyr 
245 

TCG GAT CCA 
Ser Asp Pro 
260 

TTT CCT AAT 

Phe Pro Asn 



GAA GAT GCA 
Glu Asp Ala 



CGA GTT CTC 
Arg Val Leu 
310 



GGT GGT AAA 
Gly Gly Lys 



CTA CGT GCA 
Leu Arg Ala 



GGA TTC TTT 

Gly Phe Phe 
265 

ACT GAA TTT 
Thr Glu Phe 
280 

CCT GAT GAA 

Pro Asp Glu 
295 

AAA AAT GAA 

Lys Asn Glu 



CCT GAC GTT 
Pro Asp Val 
235 

AGT GAT GAT 
Ser Asp Asp 
250 

TCC AAT GCT 
Ser Asn Ala 



GTC AAA GTA 
Val Lys Val 



ATG GGA AAA 
Met Gly Lys 
300 

CAA TAA 
Gin 



GTA CAA ATT 
Val Gin lie 



TTA CCA AAA 
Leu Pro Lys 
255 

ATT GTT GAA 
lie Val Glu 

270 

AAA GGT CTT 
Lys Gly Leu 
285 

TAT ATC AAA 
Tyr lie Lys 



GTT 72 0 

Val 

240 

ATG 768 
Met 



GGC 816 
Gly 



CAT 864 
His 



TCG 912 
Ser 



945 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TTTGAATTC A TGTACAGGAT GCAACTCCTG 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TTTGAATTCA GTAGGTGCAC TGTTTGTCAC 
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WHAT IS CLAIMED: 

1. A method for producing an artificial chromosome, comprising: 

introducing a DNA fragment into a cell, wherein the DNA 
fragment comprises a selectable marker; 
5 growing the cell under selective conditions to produce cells 

that have incorporated the DNA fragment into their genomic DNA; and 

selecting a cell that comprises a satellite artificial 
chromosome [SATAC], 

2. The method of claim 1, further comprising, isolating the 
10 SATAC. 

3. A SATAC produced by the method of claim 1. 

4. The method of claim 1, wherein the DNA fragment 
comprises a sequence of nucleotides that targets the fragment to the 
heterochromatic region of a chromosome. 

15 5. The method of claim 4, wherein the targeting sequence of 

nucleotides is satellite DNA. 

6. A cell containing an artificial chromosome, wherein the 
artificial chromosome is produced by the method of claim 1. 

7. The cell of claim 6, wherein the artificial chromosome is a 
20 satellite artificial chromosome [SATAC]. 

8. An isolated substantially pure satellite artificial chromosome 
[SATAC]. 

9. The SATAC of claim 8 that is a megachromosome, 
comprising about 50 to about 450 megabases [Mb]. 

25 10. The SATAC of claim 9, comprising about 250 to about 

400 Mb. 

1 1 . The SATAC of claim 9, comprising about 1 50 to about 
200 Mb. 

1 2. The SATAC of claim 9, comprising about 90 to about 1 20 Mb. 
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13. The SATAC of claim 9, comprising about 60 to about 100 Mb. 

14. The method of claim 1, further comprising introducing a 
fragmentation vector, whereby megachromosomes in the cells of step B 
are reduced in size to resulting in cells that contain SATACs that are 

5 about 15 to about 50 Mb. 

15. The method of claim 14, further comprising selecting a cell 
that comprises a satellite artificial chromosome [SATAC] that comprises 
about 1 5 to about 50 Mb. 

16. A cell containing an artificial chromosome, wherein the 
10 artificial chromosome is produced by the method of claim 14. 

17. A cell containing an artificial chromosome, wherein the 
artificial chromosome is produced by the method of claim 15. 

18. The cell of claim 15, wherein the artificial chromosome is a 

SATAC comprising about 15 to about 50 Mb. 
15 19. An isolated substantially pure satellite artificial chromosome 

[SATAC] that comprises about 15 to about 50 Mb. 

20. The method of claim 1, further comprising isolating the 
SATAC from the cell. 

21. The method of claim 1, wherein isolation is effected by: 
20 isolating metaphase chromosomes; 

staining the chromosomes with DNA sequence-specific dyes; and 
separating the SATACs from other chromosomes in the cells by 
flow cell sorter. 

22. A method for producing an artificial chromosome, 
25 comprising: 

introducing a DNA fragment into a cell, wherein the DNA 
fragment comprises a selectable marker, 

growing the cell under selective conditions to produce cells 
that have incorporated the DNA fragment into their genomic DNA, 
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selecting from among those cells, a cell that has a 
chromosome that comprises a de novo centromere. 

23. The method of claim 22, further comprising isolating that cell 
with the chromosome that comprises the de novo centromere, and 

5 growing the cell under conditions whereby a cell with a sausage 
chromosome is produced. 

24. The method of claim 23, further comprising isolating the cell 
with the sausage chromosome; and growing the cell under conditions 
whereby a first SATAC is produced. 

10 25. The method of claim 24, further comprising: 

introducing a fragmentation vector that is targeted to the first 
SATAC; growing the cells; and selecting a cell that comprises a second 
SATAC, wherein the second SATAC is smaller than the first SATAC. 

26. The method of claim 22, wherein the selected cell has a 
15 dicentric chromosome comprising the de novo centromere. 

27. The method of claim 22, wherein the selected cell has a 
formerly dicentric chromosome and a minichromosome comprising the de 
novo centromere. 

28. The method of claim 22, wherein the selected cell has a 
20 formerly dicentric chromosome. 

29. A method for producing an artificial chromosome, comprising 

introducing a DNA fragment into a cell, wherein the DNA 
fragment comprises a selectable marker; 

growing the cell under selective conditions to produce cells 
25 that have incorporated the DNA fragment into their genomic DNA; 

selecting from among those cells a cell that has produced a 
dicentric chromosome; and 

growing that cell under selective conditions, whereby a cell 
that contains a chromosome comprising a heterochromatic arm is 
30 produced. 
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30. The method of claim 29, further comprising selecting the cell 
with the chromosome comprising the heterochromatic arm and growing it 
in the presence of an agent that destabilizes the chromosome. 

31. The method of claim 30, further comprising identifying cells 
5 that contain a heterochromatic chromosome that is about 50 to about 

400 Mb. 

32. A method for producing a transgenic animal, comprising 
introducing a satellite artificial chromosome [SATAC] into an embryonic 
cell. 

10 33. The method of claim 32, wherein the embryonic cell is a 

stem cell. 

34. The method of claim 32, wherein the embryonic cell is in an 
embryo. 

35. The method of claim 32, wherein the SATAC comprises 
15 heterologous DNA that encodes a therapeutic product. 

36. The method of claim 32, wherein the product is the cystic 
fibrosis transmembrane regulatory protein [CFTR], an anti-HIV ribozyme, 
or a tumor suppressor gene. 

37. The method of claim 32, wherein the anti-HIV ribozyme is an 
20 ant\-gag ribozyme, and the tumor suppressor gene is p53. 

38. The method of claim 32, wherein the product comprises an 
antigen that upon expression induces a immunoprotective response 
against a pathogen in the transgenic animal. 

39. The method of claim 32, wherein the product comprises a 
25 plurality of antigens that upon expression induce an immunoprotective 

response against a plurality of pathogens. 

40. The method of claim 32, wherein the transgenic animal is a 
fish, insect, reptile, amphibians, arachnid or mammal. 
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41. The method of claim 32, wherein the SATAC is introduced 
by cell fusion, microinjection, microcell fusion, electroporation, 
microprojectile bombardment or direct DNA transfer. 

42. A transgenic animal produced by the method of claim 32. 
5 43. A method of for producing a transgenic plant or animal, 

comprising: 

introducing a DNA fragment into a cell, wherein the DNA 
fragment comprises a selectable marker; 

growing the cell under selective conditions to produce cells 
10 that have incorporated the DNA fragment into their genomic DNA; and 

selecting a cell that comprises a minichromosome that is 
about 10 Mb to about 50 Mb that comprises the selectable marker and 
euchromatin; 

isolating the minichromosome and introducing it into a plant 
15 or animal cell. 

44. The method of claim 43, wherein: after selecting the cell, 
DNA encoding a gene product or products is introduced into the cell, and 
the cell is grown under selective conditions, whereby cells comprising 
minichromosomes comprising the DNA encoding the gene product(s) are 

20 produced. 

45. The method of claim 2, wherein: after selecting the cell, DNA 
encoding a gene product or products is introduced into the cell, and the 
cell is grown under selective conditions, whereby cells comprising 

SAT ACS that comprise the DNA encoding the gene product(s) are 
25 produced. 

46. A method for producing a transgenic plant, comprising 
introducing a satellite artificial chromosome [SATAC] into a plant cell; and 
culturing the cell under conditions whereby a plant is generated. 
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47. The method of claim 46, wherein the SATAC is introduced 
by protoplast fusion, microinjection, microcell fusion, electroporation, 
microprojectile bombardment or direct DNA transfer. 

48. A method for producing a gene product(s), comprising 
introducing a satellite artificial chromosome [SATAC] into a cell; and 
culturing the cell under conditions whereby the gene product(s) is (are) 
expressed. 

49. The method of claim 48, wherein the gene product is 
produced by expression of a series of genes that encode a metabolic 
pathway; and the SATAC comprises each of these genes. 

50. A method for cloning a centromere from an animal or plant, 
comprising: 

preparing a library of DNA fragments that comprise the 
genome of the plant or animal; 

introducing the each of the fragments into mammalian 
satellite artificial chromosomes [SATACs], wherein: 

each SATAC comprises a centromere from a different 
species from the selected plant or animal, and a selectable marker; 

introducing each of the SATACs into the cells and growing 
the cells under selective conditions; 

identifying cells that have a SATAC; and 

selecting from among those cells any that have a SATAC 
comprising a centromere that differs from the centromeres in the original 
SATAC. 

51 . A cell line having the identifying characteristics of any of 
TF1004G19C5, 1 9C5xHa4, H1D3 and G3D5, which have been deposited 
at the ECACC under Accession Nos. 96040926, 96040927, 96040929, 
and 96040928, respectively. 

52. A cell line, comprising a megachromosome that comprises 

about 50-400 Mb. 
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53. A cell line of claim 52, wherein the megachromosome 
comprises about 250 to about 400 Mb. 

54. A cell line of claim 52, wherein the megachromosome 
comprises about 150 to about 200 Mb. 

5 55. A cell line of claim 52, wherein the megachromosome 

comprises about 90 to about 120 Mb. 

56. A cell line of claim 52, wherein the megachromosome 
comprises about 60 to about 100 Mb. 

57. A method for gene therapy, comprising: 

10 introducing a SATAC that comprises DNA therapeutic product into 

a target cell; and 

introducing the resulting target cells into a host animal. 

58. The method of claim 57, wherein the target cells are 
lymphocytes, stem cells or nerve cells. 

15 59. The method of claim 43, wherein the minichromosome is the 

minichromosome present in the cell line EC3/7C5. 

60. The method of claim 43, wherein the chromosome is the A 
neo-chromosome in the cell line KE1 2/4. 

61. An artificial chromosome produced by the method of claim 1 
20 that comprises more euchromatin than heterochromatin. 

62. The artificial chromosome of claim 61 that is between about 
20 Mb and about 200 Mb. 

63. The artificial chromosome of claim 61 that is between about 
100 Mb and about 200 Mb. 
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ABSTRACT 

Methods for preparing cell lines that contain artificial chromosomes, 
methods for preparation of artificial chromosomes, methods for 
purification of artificial chromosomes, methods for targeted insertion of 
5 heterologous DNA into artificial chromosomes, and methods for delivery 
of the chromosomes to selected cells and tissues are provided. Also 
provided are cell lines for use in the methods, and cell lines and 
chromosomes produced by the methods. In particular, satellite artificial 
chromosomes that, except for inserted heterologous DNA, are 
10 substantially composed of heterochromatin are provided. Methods for 
use of the artificial chromosomes, including for gene therapy, production 
of gene products and production of transgenic plants and animals are also 
provided. 
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DECLARATION FOR PATENT APPLICATION 

As a below-named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below next to 
my name. 

I believe I am the original, first and sole inventor (if only one name is listed 
below) or an original, first and joint inventor (if plural names are listed below) of 
the subject matter which is claimed and for which a patent is sought on the 
invention entitled: 

ARTIFICIAL CHROMOSOMES, USES THEREOF AND 
METHODS FOR PREPARING ARTIFICIAL 
CHROMOSOMES 

the specification of which (check one) 
( ) is attached hereto. 

(X) was filed by an authorized person on my behalf on 

April 10. 1996 as Application Serial No. 08/629,822 . 

I hereby state that I have reviewed and understand the contents of the above- 
identified specification, including the claims as amended by any amendment 
referred to above. 

I acknowledge the duty to disclose information which is material to the 
.examination of this application in accordance with Title 37, Code of Federal 
Regulations, § 1.56(a). 

I hereby claim foreign priority benefits under Title 35, United States Code, 
§1 19(a)-(d) or §365(b) of any foreign application(s) for patent or inventor's 
certificate listed below and so identified, or §365(a) of any PCT international 
application that designated at least one country other than the United States of 
America, listed below, and I have also identified below any foreign application 
for patent or inventor's certificate or PCT international application on this 
invention filed by us or our legal representatives or assigns and having a filing 
date before that of the application on which priority is claimed. 

Priority 
Claimed 

Number Country Day/Month/Year Filed (Yes or No) 
N/A 

I hereby claim benefit under Title 35, United States Code, §1 19(e) of any 
United States provisional application(s) listed below: 

Application Serial No. Filing Date 

N/A 



-1 of 3- 



DOCKET NO. 6869-402 



I hereby claim the benefit under Title 35, United States Code, §120 of any 
United States application(s) listed below and, insofar as the subject matter of 
each of the claims of this application is not disclosed in the prior United States 
application in the manner provided by the first paragraph of Title 35, United 
States Code, §112, I acknowledge the duty to disclose material information as 
defined in Title 37, Code of Federal Regulations, § 1.56(a) which occurred 
between the filing date of the prior application and the national or PCT 
international filing date of this application: 



Application Serial No. Filing Date Status 



I hereby declare that all statements made herein of my own knowledge are true 
and that all statements made on information and belief are believed to be true; 
and further that these statements were made with the knowledge that willful 
false statements and the like so made are punishable by fine or imprisonment, 
or both, under Section 1001 of Title 18 of the United States Code and that 
such willful false statements may jeopardize the validity of the application or 
any patent issued thereon. 

I hereby appoint the following attorneys and agents, with full power of 
substitution and revocation, to prosecute this application and to transact all 
business in the United States Patent and Trademark Office connected therewith 
and request that all correspondence and telephone calls in respect to this 
application be directed to Stephanie Seidman, 

BROWN, MARTIN, HALLER & McCLAIN, 1660 Union Street, San Diego, 
California 92101-2926: 



N/A 



PCT Application Serial No. 
N/A 



Filing Date 



Status 



Attorney 



Reg. No. 



James W. McClain 
Paula K. Schoeneck 



Neil R. Martin 
John L. Haller 



Stephanie Seidman 
Carl Brown 



33,779 
18,587 
23,088 
27,795 
24,536 
39,362 



Address for correspondence: Stephanie Seidman 



BROWN, MARTIN, HALLER & McCLAIN 

1660 Union Street 

San Diego, CA 92101-2926 
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Full name joint inventor: 
Inventor's signature: 
Date: 

Residence: 

Post Office Address: 

Citizenship: 



GYULA HADLACZKY 




HUNGARY 



H. 6723. SZEGED 
SZAMOS U.I .A. IX. 36. 
HUNGARY 



HUNGARIAN 



Full name of joint inventor: 
Inventor's signature: 
Date: 

Residence: 

Post Office Address: 

Citizenship: 



ALADAR A. SZALAY 



T 

HIGHLAND. CALIFORNIA 
7327 FAIRWOOD 



HIGHLAND, CALIFORNIA 92346 
UNITED STATES OF AMERICA AND 
CANADA 
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Attorney Docket No. 6869-402 
IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



Applicant(s) or Patentee(s): HADLACZKY, Gyula and SZALAY, Aldar 
Serial No. or Patent No.: 08/629,822 
Filed or Issued: April 10, 1996 

For: ARTIFICIAL CHROMOSOMES, USES THEREOF AND METHODS FOR PREPARING 
ARTIFICIAL CHROMOSOMES 

VERIFIED STATEMENT (DECLARATION) CLAIMING 
SMALL ENTITY STATUS (37 C.F.R. §§ 1.9(f) 
and 1.27(c)) - SMALL BUSINESS CONCERN 

I hereby declare that I am 

( ) the owner of the small business concern identified beiow: 
(X ) an official of the small business concern empowered to act on behalf of the 
concern identified below: 

NAME OF CONCERN AMERICAN GENE THERAPY INC. 

ADDRESS OF CONCERN 5022 1 54TH STREET, EDMONTON, CANADA T6H5P3 

I hereby declare that the above identified small business concern qualifies as a small business 
concern as defined in 13 C.F.R. §121.3-18 and reproduced in 37 C.F.R. §1.9(d), for 
purposes of paying reduced fees under Section 41(a) and (b) of Title 35, United States Code, 
in that the number of employees of the concern, including those of its affiliates, does not 
exceed 500 persons. For purposes of this statement, (1) the number of employees of the 
business concern is the average over the previous fiscal year of the concern of the persons 
employed on a full-time, part-time or temporary basis during each of the pay periods of the 
fiscal year, and (2) concerns are affiliates of each other when either, directly or indirectly, 
one concern controls or has the power to control the other, or a third party or parties 
controls or has the power to control both. 

I hereby declare that rights under contract or law have been conveyed to and remain ^th 
the small business concern identified above with regard to the invention, entitled ARTIFICIAL 
CHROMOSOMES, USES THEREOF AND METHODS FOR PREPARING ARTIFICIAL 
CHROMOSOMES by inventor(s) GYULA HADLACZKY and ALADAR SZALAY descnbed in 

( ) the specification filed herewith 

(X) application Serial No. 08/629,822, filed April 10, 1996 

( ) Patent No. , issued . . 



If the rights held by the above identified small business concern are not exclusive, each 
individual, concern or organization having rights to the invention is listed below and no 
rights to the invention are held by any person, other than the inventor, who could not quality 
as a small business concern under 37 C.F.R. § 1.9(d) or by any concern which would not 
qualify as a small business concern under 37 C.F.R. § 1.9(d) or a nonprofit organization under 
37 C.F.R. 51.9(e). 
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"NOTE: Separate verified statements are required from each named person, concern 
or organization having rights to the invention averring to their status as small entities 
(37 C.F.R. §1.27). 



Full Name 
Address 



Full Name 
Address 



Full Name 
Address 



LOMA LINDA UNIVERSITY 



Center for Molecular Biology & Gene Therapy, Mortensen Hall, Loma 

Linda. CA 90350 

( ) Individual ( ) Small Business Concern (X) Nonprofit Organization 

CHROMOS MOLECULAR SYSTEMS. INC. 

1055 West Hastings. Suite 2020. Vancouver. BC V6E2E9 

( ) Individual (X) Small Business Concern ( ) Nonprofit Organization 

THE BIOLOGICAL RESEARCH CENTER OF THE HUNGARIAN ACADEMY 

OF SCIENCES 

H-6701 Szeged, P.Q.B. 521. HUNGARY 

( ) Individual ( ) Small Business Concern (X) Nonprofit Organization 



I acknowledge the duty to file, in this application or patent, notification of any change in 
status resulting in loss of entitlement to small entity status prior to paying, or at the time of 
spaying, the earliest of the issue fee or any maintenance fee due after the date on which 
status as a small entity is no longer appropriate (37 C.F.R. § 1.28(b)). 

I hereby declare that all statements made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true; and further that these 
statements were made with the knowledge that willful false statements and the like so made 
are punishable by fine or imprisonment, or both, under Section 1001 of Title 1 8 of the 
United States Code, and that such willful false statements may jeopardize the validity of the 
application, any patent issuing thereon, or any patent to which this verified statement is 
directed. 



NAME OF PERSON SIGNING 

TITLE OF PERSON OTHER THAN OWNER 

ADDRESS OF PERSON SIGNING 



SIGNATURE 
DATE 



Attorney Docket No. 6869-402 



IN THF UNITED STATES PATENT AND TRADFMARK OFFICE 
Applicant(s) or Patentee(s): HADLACZKY, Gyula and SZALAY, Aladar 
Serial No. or Patent No.: 08/629,822 
Filed or Issued: April 10, 1996 

For: ARTIFICIAL CHROMOSOMES, USES THEREOF AND METHODS FOR PREPARING 
ARTIFICIAL CHROMOSOMES 

VERIFIED STATEMENT (DECLARATION) CLAIMING 
SMALL ENTITY STATUS {37 C.F.R. §§ 1.9(f) 
and 1.27(c)) - SMALL BUSINESS CONCERN 

I hereby declare that I am 

( ) the owner of the small business concern identified below: 
(x) an official of the small business concern empowered to act on behalf of the 
concern identified below: 

NAME OF CONCERN CHROMOS MOLECULAR SYSTEMS, INC. 
ADDRESS OF CONCERN 1055 WEST HASTINGS, SUITE 2020, VANCOUVER, BC 

V6E239 

I hereby declare that the above identified small business concern qualifies as a small business 
concern as defined in 1 3 C.F.R. § 1 21 .3-1 8 and reproduced in 37 C.F.R. § 1 .9(d), for 
purposes of paying reduced fees under Section 41(a) and (b) of Title 35, United States Code, 
in that the number of employees of the concern, including those of its affiliates, does not 
exceed 500 persons. For purposes of this statement, (1) the number of employees of the 
business concern is the average over the previous fiscal year of the concern of the persons 
employed on a full-time, part-time or temporary basis during each of the pay periods of the 
fiscal year, and (2) concerns are affiliates of each other when either, directly or indirectly, 
one concern controls or has the power to control the other, or a third party or parties 
controls or has the power to control both. 

I hereby declare that rights under contract or law have been conveyed to and remain with 
the small business concern identified above with regard to the invention, entitled ARTIFICIAL 
CHROMOSOMES, USES THEREOF AND METHODS FOR PREPARING ARTIFICIAL 
CHROMOSOMES by inventor(s) GYULA HADLACZKY and ALADAR SZALAY described in 

( ) the specification filed herewith 

(X) application Serial No. 08/629,822, filed April 10, 1996 

( ) Patent No. , issued 



If the rights held by the above identified small business concern are not exclusive, each 
individual, concern or organization having rights to the invention is listed below and no 
rights to the invention are held by any person, other than the inventor, who could not qualify 
as a small business concern under 37 C.F.R. § 1.9(d) or by any concern which would not 
qualify as a small business concern under 37 C.F.R. 11.9(d) or a nonprofit organization under 
37 C.F.R. §1.9(e). 



Attorney Docket No. 6869-402 



'NOTE: Separate verified statements are required from each named person, concern 
or organization having rights to the invention averring to their status as small entities 
(37 C.F.R. §1.27). 



Full Name 
Address 



Full Name 
Address 



Full Name 
Address 



LOMA LINDA UNIVERSITY 



Center for Molecular Biology & Gene Thera py. Mortensen Hall, Loma — 

I inria. CA 90350 . 

( ) Individual ( ) Small Business Concern (X) Nonprofit Organization 



AMERICAN GENE THERAPY INC. . 

5022 154TH STREET. EDMONTON. CANADA T6H5PE . 

( ) Individual (X) Small Business Concern ( ) Nonprofit Organization 

THE BIOLOGICAL RESEARCH CENTER OF THE HU NGARIAN ACADEMY 
OF SCIENCES 



H-6701 Szeaed. P.O.B. 521. HUNGARY 

( ) Individual ( ) Small Business Concern (X) Nonprofit Organization 



I acknowledge the duty to file, in this application or patent, notification of any change in 
status resulting in loss of entitlement to small entity status prior to paying, or at the time of 
paying, the earliest of the issue fee or any maintenance fee due after the date on which 
status as a small entity is no longer appropriate (37 C.F.R. § 1.28(b)). 

I hereby declare that ail statements made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true; and further that these 
statements were made with the knowledge that willful false statements and the like so made 
are punishable by fine or imprisonment, or both, under Section 1001 of Title 1 8 of the 
United States Code, and that such willful false statements may jeopardize the validity of the 
application, any patent issuing thereon, or any patent to which this verified statement is 
directed. 



NAME OF PERSON SIGNING 

TITLE OF PERSON OTHER THAN OWNER 

ADDRESS OF PERSON SIGNING 



XLoO-g) - to S3~ IaJ^o^- 'Hop ft -To 1 -. 



tec 



SIGNATURE 




PATENT APPLICATION 
Attorney Docket No. 6869-402 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

Applicant: HADLACZKY, Gyuia and SZALAY, Aladar 

Serial No.: 08/629,822 

Filed: April 10, 1996 

For: ARTIFICIAL CHROMOSOMES, USES THEREOF AND METHODS FOR PREPARING ARTIFICIAL 
CHROMOSOMES 

VERIFIED STATEMENT (DECLARATION) CLAIMING SMALL ENTITY STATUS 
(37 C.F.R. §§ 1.9(f) and 1 .27(d) - NONPROFIT ORGANIZATION 

I hereby declare that I am an official empowered to act on behalf of the nonprofit organization 
identified below: 

NAME OF ORGANIZATION: THE BIOLOGICAL RESEARCH CENTER 

OF THE HUNGARIAN ACADEMY OF SCIENCES 
ADDRESS OF ORGANIZATION: H-6701 SZEGED. P.O.B. 521 

HUNGARY 

TYPE OF ORGANIZATION 

(X) University or other Institution of Higher Education 

( ) Tax Exempt under Internal Revenue Service Code (26 U.S.C. §§501 (a) and 501(c) (3)) 
' ( ) Nonprofit Scientific or Educational under Statute of State of the United States of 

America (Name of State ) (Citation of Statute ) 

( ) Would qualify as tax exempt under Internal Revenue Service Code (26 U.S.C. §§501 (a) 

and 501 (c) (3)) if located in the United States of America 
( ) Would qualify as nonprofit Scientific or Educational under Statute of State of the United 
States of America if located in the United States of America (Name of State 
) (Citation of Statute . ) 

I hereby declare that the nonprofit organization identified above qualifies as a nonprofit organization 
as defined in 37 C.F.R. §1 .9(e) for purposes of paying reduced fees under Section 41(a) and (b) of 
Title 35, United States Code, with regard to the invention entitled ARTIFICIAL CHROMOSOMES, 
USES THEREOF AND METHODS FOR PREPARING ARTIFICIAL CHROMOSOMES by inventor(s) Gyula 
Hadlaczky and Aladar Szalay described in 

(X) application Serial No. 08/629,822, filed April 10, 1996 
I hereby declare that rights under contract or law have been conveyed to and remain with the 
nonprofit organization with regard to the above-identified invention. 

If the rights held by the nonprofit organization are not exclusive, each individual, concern or 
organization having rights to the invention is listed below' and no rights to the invention are held by 
any person, other than the inventor, who could not qualify as a small business concern under 37 
C.F.R. § 1.9(d) or by any concern which would not qualify as a small business concern under 37 
C.F.R. § 1.9(d) or a nonprofit organization under 37 C.F.R. § 1.9(e). 



Attorney Docket No. 6869-402 



"NOTE: Separate verified statements are required from each named person, concern or 
organization having rights to the invention averring to their status as small entities (37 C.F.R. 
§1.27). 

Full Name Loma Linda University — 

Address Center for Molecular Biology & Gene Therapy, Mortensen Hall. Loma L inda, CA 

920350 

( ) Individual (X) Small Business Concern ( ) Nonprofit Organization 

Full Name American Gene Therapy Institute 

Address 5022 154th Street, Edmonton, Canada T6H5P3 

( ) Individual (X) Small Business Concern ( ) Nonprofit Organization 

Full Name Chromos Molecular Systems, Inc. — _ _ 

Address 1055 West Hastings. Suite 2020, Vancouver. BC V6E2E9 . 

( ) Individual (X) Small Business Concern < ) Nonprofit Organization 



I acknowledge the duty to file, in this application or patent, notification of any change in status 
resulting in loss of entitlement to small entity status prior to paying, or at the time of paying, the 
earliest of the issue fee or any maintenance fee due after the date on which status as a small entity is 
no longer appropriate (37 C.F.R. § 1.28(b)). 

I hereby declare that all statements made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true; and further that these statements 
were made with the knowledge that willful false statements and the like so made are punishable by 
fine or imprisonment, or both, under Section 1001 of Title 18 of the United States Code, and that 
such willful false statements may jeopardize the validity of the application, any patent issuing 
thereon, or any patent to which this verified statement is directed. 



NAME OF PERSON SIGNING 
TITLE IN ORGANIZATION 
ADDRESS OF PERSON SIGNING 



SIGNATURE 
DATE 




Prof. Pal Venetianer 



Director General 



Biological Research Center H-6726 Szeged, Temesvari kr 



62. Hungary 



V- 



zr 
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